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Introduction 


In a sense, this book is a celebration of the Black-Scholes model. Widely criticized 
for its shortcomings, ever since the dramatic Long Term Capital Management 
meltdown in 1998, this ‘first generation’ model is still the first benchmark in 
financial modelling. May it be the Heston, Stein-Stein, Bergomi, local volatility, 
local-stochastic volatility, Lévy, uncertain volatility or fancy hybrid model: they are 
all perturbations of the Black-Scholes model, typically either making volatility 
stochastic or introducing jumps. If Black-Scholes assumes, say 20-40 % volatility, 
all the above extensions more or less agree with this order of magnitude for overall 
volatility, and indeed the around-the-money volatility smile is plainly a perturbation 
of the flat implied volatility corresponding to Black-Scholes. 

The aforementioned extensions do not come with tractable option price for- 
mulae, since most diffusion processes do not admit closed-form transition densities. 
The second best situation is a closed-form Fourier Transform of the transition 
density, and many of the aforementioned extensions share this property, addition- 
ally to explaining some stylized facts from the dynamics of the implied volatility 
surface. 

An alternative to the Fourier approach is given by asymptotic expansions of 
transition densities of stochastic processes, say in the short-time (or more generally 
small-noise) limit. Such investigations go back to S.R. Srinivasa Varadhan in the 
late 1960s and are intimately connected to his theory of large deviations: it is pretty 
unlikely for a particle starting at some position to diffuse to some other position if 
there is almost no time to do so (or if the driving noise is switched off). The beauty 
of large deviations is to explicitly identify a precise scale rough enough to be 
computable (or at least to be characterized in terms of some variational problem), 
and fine enough to capture the most important leading-order behaviour of the 
system. In the context of transition densities, or heat-kernels in PDE terminology, 
complete expansions have been derived in the 1970s and 1980s, with a bulk of 
geometric information hidden in the coefficients. The Russian school has also been 
fundamental in the development of (sample path) large deviations for stochastic 
processes, in particular through the works of Mark Freidlin and Alexander Wentzell 
in the 1970s. On a historical note, it is interesting to remember that large deviations 
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theory was originally developed (in the finite-dimensional case) by Harald Cramér 
in the 1930s for actuarial mathematics. 

A widely circulated preprint by Patrick Hagan et al. (following the famous 
SABR paper), first presented in 2001 by Andrew Lesniewski at the Courant Finance 
Seminar, intensified the connection between heat-kernels, geometry and finance. 
The resulting SABR formula has become industry standard in fixed income mod- 
elling (and presumably a long-time headache for quants tortured by risk manage- 
ment). The topic was further explored by а number of people including Marco 
Avellaneda, Christian Bayer, Gérard Ben Arous, Jéróme Busca, Jean-Dominique 
Deuschel, Martin Forde, Pierre Henry-Labordére, Elton Hsu, Peter Laurence, 
Cheng Ouyang and many others (including, unsurprisingly, all the editors of this 
volume). 

Despite the undisputed mathematical depth of this development, the agenda has 
been largely initiated by people in or near the industry, a quick publication not 
always being their first priority. This, at least, is our only explanation for the fact 
that some key papers have remained preprints ever since, though widely circulating 
and used for years. We also note that the derivation of closed-form approximation 
formulae in various non-tractable models remains a constant topic in major 
academic and industry meetings alike, not to mention some specialist meetings 
(Vienna 2009, Berlin 2011, London 2013) organized by factions of the present 
group of editors in different constellations. 

The present proceedings grew on this fertile ground. Contributions include some 
unpublished classics (in brushed-up versions), notably the aforementioned preprint 
by Patrick Hagan et al. as well as recent works touching the theme of large devi- 
ations and/or asymptotic expansions in mathematical finance. 

The editors have known each other for a long time. The idea for this book project 
was born in July 2013, but the first step towards realization was overshadowed by a 
sad event: we are still shocked that our esteemed colleague and friend, whom we 
had invited to co-edit this volume, has never received his invitation: Peter Laurence 
passed away unexpectedly in August 2013. 

Peter Laurence was born in New York, NY, on 27 March 1952. After under- 
graduate courses at the Wharton School of Finance and Commerce at the 
University of Pennsylvania, he obtained a Bachelor of Science in Mathematics and 
Philosophy degree in 1973. He also obtained a Master of Science degree (1977) 
and a Ph.D. degree (1981) from the University of Wisconsin Madison. From 
1974—1991, Peter was a faculty member at the University of Wisconsin, the 
Courant Institute of Mathematical Sciences at New York University, Worcester 
Polytechnic Institute, Pennsylvania State University, and the University of Milano, 
Italy. From 1991 till his untimely death in 2013, Peter was a professor at Sapienza 
Università di Roma and a visiting scholar at the Courant Institute. Peter published 
more than 60 research papers, co-authored a book “Quantitative Methods of 
Derivative Securities: From Theory to Practice" with Marco Avellaneda, and was 
one of the editors of the volume “Quantitative Energy Finance: Modeling, Pricing, 
and Hedging in Energy and Commodity Markets". His long-term friend Marco 
Avellaneda remembers: Peter had an infinite joie de vivre... This involved a lot of 
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research in Math Physics, one of his passions. I enjoyed discussing Math Physics 
with him. We also began our interest in finance in the 90s and co-authored [our] 
book. He had a kind heart. May he rest in peace and live in us who are still here. 
Or as Bruno Dupire articulates it: He was a gentleman and will be missed. 

Each of us has stories to tell about Peter and his inexhaustible passion for 
mathematics and its impact on finance. Instead of trying to fit them in this intro- 
duction we rather let him speak through mathematics: some of Peter Laurence's 
final contributions to mathematical finance do appear in these proceedings, with the 
kind agreement of the respective co-authors. 

We are indebted to all the reviewers who helped us achieving this work. It is also 
our pleasure to thank Magdalena Mueller-Laurence, as well as the Springer 
Proceedings team, without whom this book would never have appeared. 


January 2015 Peter K. Friz 
Jim Gatheral 

Archil Gulisashvili 

Antoine Jacquier 

Josef Teichmann 


Probability Distribution in the SABR Model 
of Stochastic Volatility 


Patrick Hagan, Andrew Lesniewski and Diana Woodward 


Abstract We study the SABR model of stochastic volatility (Wilmott Mag, 2003 
[10]). This model is essentially an extension of the local volatility model (Risk 
7(1):18—20 [4], Risk 7(2):32-39, 1994 [6]), in which a suitable volatility parameter is 
assumed to be stochastic. The SABR model admits a large variety of shapes of volatil- 
ity smiles, and it performs remarkably well in the swaptions and caps/floors markets. 
We refine the results of (Wilmott Mag, 2003 [10]) by constructing an accurate and 
efficient asymptotic form of the probability distribution of forwards. Furthermore, 
we discuss the impact of boundary conditions at zero forward on the volatility smile. 
Our analysis is based on a WKB type expansion for the heat kernel of a perturbed 
Laplace-Beltrami operator on a suitable hyperbolic Riemannian manifold. 


Keywords SABR : Heat kernel expansion + WKB expansion - Implied volatility - 
Asymptotic smile formula 


1 Introduction 


The SABR model [10] of stochastic volatility attempts to capture the dynamics of 
smile in the interest rate derivatives markets which are dominated by caps/floors and 
swaptions. It provides a parsimonious, accurate, intuitive, and easy to implement 
framework for pricing, position management, and relative value in those markets. 
The model describes the dynamics of a single forward (swap or LIBOR) rate with 
stochastic volatility. The dynamics of the model is characterized by a function C (f) 
of the forward rate f which determines the general shape of the volatility skew, a 
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parameter v which controls the level of the volatility of volatility, and a parameter 
p which governs the correlation between the changes in the underlying forward rate 
and its volatility. It is an extension of Black's model: choosing v = Oand C (f) = f 
reduces SABR to the lognormal Black model, while v = 0 and C (f£) = 1 reduces 
it to the normal Black model. 

The main reason why the SABR model has proven effective in the industrial 
setting is that, even though it is too complex to allow for a closed form solution, 
it has an accurate asymptotic solution. This solution, as well as its implications for 
pricing and risk management of interest derivatives, has been described in [10]. 

In this paper we further refine the results presented in [10]. Our developments 
go in two directions. Fist, we present a more systematic framework for generating 
an accurate, asymptotic form of the probability distribution in the SABR model. 
Secondly, we address the issue of low strikes, or the behavior of the model as the 
forward rate approaches zero. 

Our way of thinking has been strongly influenced by the asymptotic techniques 
which go by the names of the geometric optics or the WKB method, and, most 
importantly, by the classical results of Varadhan [19, 20] (see also [13, 18] for 
more recent presentations and refinements). These techniques allow one to relate 
the short time asymptotics of the fundamental solution (or the Green's function) 
of Kolmogorov's equation to the differential geometry of the state space. From the 
probabilistic point of view, the Green's function represents the transition probability 
of the diffusion, and it thus carries all the information about the process. 

Specifically, let М denote the state space of an n-dimensional diffusion process 
with no drift, and let G x (s, x), x, X € М, denote the Green's function. We also 
assume that the process is time homogeneous, meaning that the diffusion matrix is 
independent of s. Then, Varadhan's theorem states that 


а(х, Xy 


lim slog Gx (5, x) = 5 


Here d (x, X) is the geodesic distance on U with respect to a Riemannian metric 
which is determined by the coefficients of the Kolmogorov equation. This gives us 
the leading order behavior of the Green's function. To extract usable asymptotic 
information about the transition probability, more accurate analysis is necessary, but 
the choice of the Riemannian structure оп M dictated by Varadhan's theorem turns out 
to be key. Indeed, that Riemannian geometry becomes an important book keeping 
tool in carrying out the calculations, rather than merely fancy language. Techni- 
cally speaking, we are led to studying the asymptotic properties of the perturbed 
Laplace-Beltrami operator on a Riemannian manifold. 

In order to explain the results of this paper we define a universal function D (C): 


КЕЗ ТЫ: 
E | 


р 


D (С) = log 
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where ¢ is the following combination of today's forward rate f, strike F, and a 
volatility parameter с (which is calibrated so that the at the money options prices 


match the market prices): 
fe v / f du 
m" ғ Cu) 


The function D (C) represents a certain metric whose precise meaning is explained 
in the body of the paper. The key object from the point of view of option pricing is 
the probability distribution of forwards Pr (r, f). Our main result in this paper is 
the explicit asymptotic formula: 


exp (- D (Q? /2rv} 
„Отт oC (Е) (cosh D (C) — p sinh D (¢))*/? 


Pr (т, f) = (Lp ox a 


In order not to burden the notation, we have written down the leading term only; the 
complete formula is stated in Sect. 5. To leading order, the probability distribution of 
forwards in the SABR model is Gaussian with the metric D (С) replacing the usual 
distance. 

From this probability distribution, we can deduce explicit expressions for implied 
volatility. The normal volatility is given by: 


On = OC (Е) (cosh D (C) — p sinh D (O) (1+ -:-). 


Precise formulas, including the subleading terms and the impact of boundary condi- 
tions at zero forward, are stated in Sect. 5. To calculate the corresponding lognormal 
volatility one can use the results of [11]. 

We would like to mention that other stochastic volatility models have been exten- 
sively studied in the literature (notably among them the Heston model [12]). Useful 
presentations of these models are contained in [5, 17]. 

A comment on our style of exposition in this paper. We chose to present the argu- 
ments in an informal manner. In order to make the presentation self-contained, we 
present all the details of calculations, and do not rely on general theorems of differ- 
ential geometry, stochastic calculus, or the theory of partial differential equations. 
And while we believe that all the results of this paper could be stated and proved 
rigorously as theorems, little would be gained and clarity might easily get lost in the 
course of doing so. 

The paper is organized as follows. In Sect.2 we review the model and formulate 
the basic partial differential equation, the backward Kolmogorov equation. We also 
introduce the Green's and discuss various boundary conditions at zero. Section3 is 
devoted to the description of the differential geometry underlying the SABR model. 
We show that the stochastic dynamics defining the model can be viewed as a pertur- 
bation of the Brownian motion on a deformed Poincare plane. The elliptic operator 
in the Kolmogorov equation turns out to be a perturbed Laplace-Beltrami opera- 
tor. This differential geometric setup is key to our asymptotic analysis of the model 
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which is carried through in Sect. 4. In Sect. 5 we derive the explicit formulas for the 
probability distribution and implied volatility which we have discussed above. In 
Appendix А we review the derivation of the fundamental solution of the heat equa- 
tion on the Poincare plane. This solution is the starting point of our perturbation 
expansion. Finally, Appendix B contains some useful asymptotic expansions. 


2 SABR Model 


In this section we describe the SABR model of stochastic volatility [10]. It is a two 
factor model with the dynamics given by a system of two stochastic differential 
equations. The state variables of the model can be thought of as the forward price 
of an asset, and a volatility parameter. In order to derive explicit expressions for the 
associated probability distribution and the implied volatility, we study the Green's 
function of the backward Kolmogorov operator. 


2.1 Underlying Process 


We consider a European option on a forward asset expiring T' years from today. The 
forward asset that we have in mind can be for instance a forward LIBOR rate, a 
forward swap rate, or the forward yield on a bond. The dynamics of the forward in 
the SABR model is given by!: 


dF, = У, С (F) aW,, (1) 
а>, = vXidZ, $ 
Here F; is the forward rate process, and W; and Z; are Brownian motions with 


where the correlation p is assumed constant. We supplement the dynamics (1) with 
the initial condition 


Б = f. 
Xo = с. 


(3) 


! Note that our notation departs somewhat from the notation used in [10]: we use У, instead of о, 
and v; instead of v;. The пате SABR is an acronym for “Stochastic Alpha Beta Rho" which was 
the name of the model originally used at Paribas. 
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Note that we assume that a suitable numeraire has been chosen so that F; is a 
martingale. The process У, is the stochastic component of the volatility of F;, and v 
is the volatility of >, (the *volvol") which is also assumed to be constant. 

The function C (x) is defined for х > 0, and is assumed to be positive, smooth, 
and integrable around 0; 


K du 
/ < oo, forall К > 0. (4) 
o C(u) 


Two examples of C, which are particularly popular among financial practitioners, 
are functions of the form: 


C(x) =x", where0 < 8 <1 (5) 
(stochastic CEV model), or 
C(x) = x +a, where a > 0 (6) 


(stochastic shifted lognormal model). 

Our analysis uses an asymptotic expansion in the parameter 027, and we thus 
require that v?T be small. In practice, this is an excellent assumption for medium and 
longer dated options. Typical for shorter dated options are significant, discontinuous 
movements in implied volatility. The SABR model should presumably be extended 
to include such jump behavior of short dated options. 

The process >; is purely lognormal and thus £; > 0 almost surely. Since, depend- 
ing on the choice of C(x), F, can reach zero with non-zero probability, we should 
take into account the boundary behavior of the process (1), as F; approaches 0. This 
can easily be done in the case of zero correlation between И; and Z;, р = 0. We 
extend the function C (x) to all values of x by setting 


C (—x) = C(x), for x « 0. (7) 


The so extended C (x) is an even function, C (—x) = C(x), for all values of x, and 
thus the process (1) is invariant under the reflection F; — — F;. The state space of 
the extended process is thus the upper half plane. Later on in this paper we shall 
discuss the Dirichlet and Neumann boundary conditions for the SABR model. 

A special case of (1) which will play an important role in our analysis is the case 
of C(x) = 1, and p = 0. In this situation, the basic equations of motion have a 
particularly simple form: 


dF, = Xid Ws, 


8 
ау, = vu,dZ,, ( ) 


with E [dW;d Z;] = 0. We shall refer to this model as the normal SABR model. 
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Local volatility [4, 6], is defined as the conditional expectation value 


ок (T, f.o)) dT = [ағу |F(0 = f, Е = К, £ (0) = о]. (9) 


or, explicitly, 


ок (T, f.oY! = C (KY io |Р (0)= f Е= К, ®(0) = о]. (10) 


Our analysis in the following sections enables us, in particular, to derive an explicit 
expression for ox. 


2.2 Green’s Function 


Green’s functions arise in finance as the prices of Arrow-Debreu securities. 
Equations (1)-(3) correspond to the Arrow-Debreu security whose payoff at time 
T is given by Dirac's delta function 6 (Fr — F, от — X). The time t < T price 
G = Gr,r.x (t, f. о) of this security is the solution to the following parabolic par- 
tial differential equation: 


ôG 1.5 3 G PG дау 


with the terminal condition: 
Gres@, fo) =0(f — F, o — X), at =T. (12) 


This equation should also be supplemented by a boundary condition at infinity such 
that G is financially meaningful. Since the payoff takes place only if the forward 
has a predetermined value in a finite amount of time, the value of the Arrow-Debreu 
security has to tend to zero as F and X become large: 


Gr.r.x (t, f. о) > 0, as Е, У — oo. (13) 
Thus G y. r, x (t, f, о) is a Green's function for (11). Once we have constructed 
it, we can price any European option. For example, the price Cr x (t, f, с) of a 


European call option struck at К and expiring at time Т can be written in terms of 
Gr, rx (t, f, о) as 


Cr.k (t, f. о) = f e- K)* Ст,кх (t, f, o) dFdX, (14) 
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where, as usual, (F — K )* = max (Е — К, 0), and where the integration extends 
over the upper half plane {(F, Spek: E> 0). 

Note that the process (1) is time homogeneous, and thus Gr r.x (t, f, с) isa 
function of the time to expiry т = Т — t only. Denoting 


GF, £ (т, }, о) = Стру (t, }, о), 


and 
Ск (т, f, o) = Ст,к (t, fo). 


we can reformulate (11)-(12) as the initial value problem: 


0G 1 ФС ФС ФС 
—— с)? 2vpC 2 |, 15 
3 757 z( Y бу +25РСОЭЗ атас + =) (15) 
and 
Grx(r.fo)-ó(f—F,o—X),atr = 0. (16) 
Introducing the marginal probability distribution 
оо 
Pr о) = | Ges er о) ах, an 
0 
we can express the call price (14) as 
оо 
Ск (т, f. о) =] (Е — K)* Pr (т, f, 0) dF. (18) 
—oo 


This formula has familiar structure, and one of our main goals will be to derive a 
useful expression for Pr (т, f). 

It is also easy to express the local volatility in terms of the Green's function. 
Indeed, 
C(Ky f? Y? Gx,» (т, fra) dd 


Jo’ Ск,х (т, f.o)dX 


M2 (т, f, 
ox (т, f. e) =C (K) VEL. (20) 


Mx (т, йере |" E Ск, (т, /,о)а® (21) 


OK (T, У, о)? = 


З (19) 


or 


where 


is the conditional second moment. 
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We will solve (15)-(17) by means of asymptotic techniques. In order to set up the 
expansion, it is convenient to introduce the following variables: 


T т x 
s= —, x= f, X =F, у= –, Y = —, 
T v v 


and the rescaled Green’s function: 
Kx y (s, x, y) = vT Gx vy (Ts, x, vy). 


In terms of these variables, the initial value problem (15) and (16) can be recast as: 


ӨК 1 , › К ФК ол) 
92 = ia [соу 5S орск) 22454), 
ðs 23 ( ee Ox? зо) дхду T ду? (22) 


К (0,x,y)=d(«-—X, y—-Y), 


where К = Ky y, and 
є = VT. (23) 


It will be assumed that є is small and it will serve as the parameter of our expansion. 
The heuristic picture behind this idea is that the volatility varies slower than the for- 
ward, and the rates of variability of f and a/v are similar. The time T defines the time 
scale of the problem, and thus s is a natural dimensionless time variable. Expressed in 
terms of the new variables, our problem has a natural differential geometric content 
which is key to its solution. 

Finally, let us write down the equations above for the normal SABR model: 


OK 1 (ж +25) 
= Е $ 
ðs 2^7 Vog ду? 
К (0,х,у)=б(х—-Х,у—Ү). 


(24) 


We will show later that this initial value problem has a closed form solution. 


2.3 Boundary Conditions at Zero Forward 


The problem as we have formulated it so far is not complete. Since the value of the 
forward rate should be positive,” we have to specify a boundary condition for the 
Green's function at x — 0. Three commonly used boundary conditions are [9]: 


?Recent history shows that this is not always necessarily the case, but we regard such occurances 
as anomalous. 
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e Dirichlet (or absorbing) boundary condition. We assume that the Green's function, 
denoted by Key (s, x, y), vanishes at x = 0, 


K? y (s,0, y) =0. Q5) 


e Neumann (or reflecting) boundary condition. We assume that the derivative of 
the Green's function at x — 0, normal to the boundary (and pointing outward), 
vanishes. Let K y y ($, x, y) denote this Green's function; then 


д 
эу Kx 6,0.) =0. (26) 
e Robin (or mixed) boundary condition. The Green's function, which we shall denote 
by K m (s, x, y), satisfies the following condition. Given 7 > 0, 


д 
(- a n) КЕ y (5,0, y) = 0. (27) 


From the financial point of view, the relevant boundary conditions are the Dirichlet 
and Neumann conditions. It is well known that the Green's functions corresponding 
to these different boundary conditions obey the following conditioning inequalities: 


К? <к < К“. (28) 


Since the Dirichlet boundary condition corresponds to the stochastic process being 
killed at the boundary, the total mass of the Green's function is less than one: 


] к?» (s, x, y) dx dy « 1. (29) 


The remaining probability is a Dirac's delta function at x — 0. On the other hand, 
for the free and Neumann boundary conditions, 


J Kartz yazdy = | кў б,х,у)ахау =1, (30) 


апа so they are bona fide probability distributions. 

Our method allows for deriving explicit expressions for the Green's functions in 
the case of zero correlation. In this case, the differential operator in (22) is invariant 
under a #2 group action given by the reflection x — —x of the upper half plane. 
This allows us to construct the desired Green's functions by means of the method 
of images. Namely, let K x y (s, x, y) denote now the solution to (22) with C(x) 
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extended to the entire upper half plane, as explained in Sect. 2.1.2 Then, one verifies 
readily that 
Ky y ($; x, y) = Kx y ($, x, y) — Ку ү ($, —x, у), (31) 


апа 
Ky y (s, x, y) = Kx,v (s, x, y) + Kxy ($, —x, y) (32) 


are the solutions to the Dirichlet and Neumann problem, respectively. 


2.4 Solving the Initial Value Problem 


It is easy to write down a formal solution to the initial value problem (22). Let L 
denote the partial differential operator 


1 д? д? д? 
L=- y |C + 2р (х) —— + = 33 
2? ( (x) 3x2 TPP ӨЛ ду ду? (33) 
supplemented by a suitable boundary condition at x = 0. Consider the one-parameter 
semigroup of operators 
U (s) = exp(seL). (34) 


Then U solves the following initial value problem: 


CNET 
as 
U (0) = I, 


and thus the Green's function Ky y (s, x, y) is the integral kernel of U (s): 
Kx,y (s, x, y) = U (s) (х, y; X. Y). (35) 
In order to solve the problem (22) it is thus sufficient to construct the semigroup 
U (s) and find its integral kernel. Keeping in mind that our goal is to find an explicit 
formula for Kx y (s, x, y), the strategy will be to represent L as the sum 
L = Lo+ V, (36) 


where Lo is a second order differential operator with the property that 


Uo (s) = exp (seLo) (37) 


3This solution ignores any boundary condition at x = 0 and is sometimes referred to as the Green's 
function with a free boundary condition. 
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can be represented in closed form. Specifically, we will proceed in several steps. We 
start with the normal SABR model defined in Sect. 2.1, and notice that the corre- 
sponding operator L is a well known object, namely the generator of the Brownian 
motion on the upper half-plane. The integral kernel of the semigroup U (5) generated 
by this operator can be represented as an explicit integral over the real axis. Next we 
observe that the general SABR model can naturally be mapped on the normal SABR 
model by means of a suitable diffeomorphism ф. We find that, under this mapping, 
the operator L is the sum of two parts: (1) the pullback of the generator of the Brown- 
ian motion on the upper half-plane, denoted by Lo, and (ii) a perturbation V. The 
kernel of the semigroup generated by Lo has an explicit integral representation. The 
operator V turns out to be a differential operator of first order, and we will treat it as 
a small perturbation of the operator Lo. 
The semigroup U (5) can now be expressed in terms of Ug (s) and V as 


U (s) = О (s) Uo (5). (38) 


Here, the operator Q (s) is given by the well known regular perturbation expansion: 


О (5) = 1+ Y e51®dLo (у)... еба) (V) ds, ...dsn, (39) 


51...53 LSE 
]xn«oo " 5312525 


where adz, is the commutator with Lo: 
adr, (V) = LoV — V Lo. (40) 


We will use the first few terms in the expansion above in order to construct an 
accurate approximation to the Green's function K x y (s, x, y): 


Q(s) = IJ seV + ; (se)? (aaz, (V) + v?) +0 (29 | (41) 


We shall disregard the convergence issues associated with this series, and use it solely 
as a tool to generate an asymptotic expansion. 


3 Stochastic Geometry of the State Space 


In solving our model we find that the normal SABR model represents Brownian 
motion on the Poincare plane. Generally, when р = 0, or C(x) 4 1, the model 
amounts to Brownian motion on a two dimensional manifold, the SABR plane, per- 
turbed by a drift term. In this section we summarize a number of basic facts about 
the differential geometry of the state space of the SABR model. The fundamental 
geometric structure is that of the Poincare plane. We will show that the state space of 
the SABR model can be viewed as a suitable deformation of the Poincare geometry. 
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3.1 SABR Plane 


We begin by reviewing the Poincare geometry of the upper half plane which will 
serve as the standard state space of our model. For a full (and very readable) account 
of the theory the reader is referred to e.g. [1]. 

The Poincare plane (also known as the hyperbolic or Lobachevski plane) is the 
upper half plane Н? = ((x, у): y > 0} equipped with the Poincare line element 


2 dx? + dy? 


ds 
y? 


(42) 


This line element comes from the metric tensor given by 


1 
= (0 ae (43) 


The Poincare plane admits a large group of symmetries. We introduce complex 
coordinates on Н?, z = x + iy (the defining condition then reads Imz > 0), and 
consider a Moebius transformation 


az+b 
к= aa (44) 


where a, b, c, d are real numbers with ad — bc = 1. We verify easily the following 
two facts. 


e Transformation (44) is a biholomorphic map of H? onto itself. 
e The Poincare metric is invariant under (44). 


As a consequence, the Lie group 


ою = |(2 ji ebed eR, ad — bc -1 (45) 


acts holomorphically and isometrically on H*. This symmetry group plays very 
much the same role in the hyperbolic geometry as the Euclidean group in the usual 
Euclidean geometry of the plane R2. 

In order to study the SABR model with the Dirichlet or Neumann boundary 
conditions at zero forward, we define the following reflection 0 : Н? — HÊ: 


(х, у) = (~x, у) (46) 
(clearly, this is a reflection with respect to the y-axis). The key fact about 0 is that it 


is an involution, i.e. 
0 o0 (2) =z. (47) 
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One can also write 0 as 0 (z) — —z, which shows that it is an anti-holomorphic map 
of H? into itself. It is easy to find the set of fixed points of 0, namely the points on 
the Poincare plane which are left invariant by б: 


0(x,y)—-(x,y)ex--O. (48) 


i.e. it is the positive y-axis. 

Let d (z, Z) denote the geodesic distance between two points z, Z € H?,z= 
x+iy,Z = X + iY, i.e. the length of the shortest path connecting z and Z. There 
is an explicit expression for d (z, Z): 


coshd (z, Z) = 1 + z-z? (49) 
Z, = 2yY , 


where |z — Z| denotes the Euclidean distance between z and Z. In particular, if 
x = X, then d (z, Z) = |log (y/Y)|. We also note that the reflection 0 is an isometry 
with respect to this metric, d (0 (2), 0 (Z)) = d (z, Z). 

We also note that since det (h) = у“, the invariant volume element on Н? is 
given by 


dun (z) = y det (h) dx dy 
ах ау (50) 


у? ' 


The state space associated with the general SABR model has a somewhat more 
complicated geometry. Let S? denote the upper half plane ((x, y) : у > 0}, equipped 
with the following metric g: 


1 1 —pC(x) ) 
= — А 51 
= ууу? („сез соу ом 
This metric is a generalization of the Poincare metric: the case of o = 0 and C(x) = 1 


reduces to the Poincare metric. In fact, the metric g is the pullback of the Poincare 
metric under a suitable diffeomorphism. To see this, we define a map ¢ : S? — Н? by 


dis == ([ z ) (52) 
о Cw Jy 


where z = (x, y). The Jacobian V ó of ¢ is 


1 р 
Уф (2) = ( E. C(x) pe ) І (53) 
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and so ¢*h = g, where ф* denotes the pullback of ó. The manifold S? is thus 
isometrically diffeomorphic with the Poincare plane. A consequence of this fact is 
that we have an explicit formula for the geodesic distance ô (z, Z) on 52: 


cosh ô (z, Z) = coshd (¢ (z) , 6 (Z)) 


x du a 2 Y X du Y 2 
T Sz С(и) ply ) fx Сау tO ) 
2 (1 — p?) yY 


, (54) 


where z = (x, y) and = (X, Y) аге мо points on S?. Since det (g) — Ca), 
the invariant volume element on S? is given by 


dug (z) = y det (g) dx dy 
_ ахау (55) 
2 С(х)у?` 


In the case of p = 0, the manifold S? carries an isometric reflection @ which 
commutes with (52): 


боф (2) = фо (х), (56) 


i.e. 0 is inherited from the corresponding reflection 0 of the Poincare plane. Explicitly, 
0 (x, y) = (—x, y). Strictly speaking, this holds only holds if x 4 0, as the metric 
(51) explodes at the boundary x = 0. 


3.2 Brownian Motion on the SABR Plane 


It is no coincidence that the SABR model leads to the Poincare geometry. Indeed, 
the dynamics of the normal SABR model is given by the Brownian motion on the 
Poincare plane. In this section we shall establish this relationship, and use it in 
Sect. 3.3 in order to find an explicit representation of the integral kernel of (37). 

Recall [13] that the Brownian motion on the Poincare plane is described by the 
following system of stochastic differential equations: 


dX; = Y,dW,, 


(57) 
dY; = YidZi;, 


with the two Wiener processes W, and Z, satisfying 


E [40,42] = 0. (58) 
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Comparing this with the special case of the normal SABR model (8), we see that (8) 
reduces to (57) once we have made the following identifications: 


Х, = 


021, 


1 (59) 
Ү, = " 2j 


and used the scaling properties of a Wiener process: 


dWy, = vaW,, 
dZy, = vdZ,. 


Note that the system (57) can easily be solved in closed form: its solution is 
given by 


t 2 
х= Хо+®% | ер (7) — 5 )aw o. 
0 


t2 
Y, = new (2 = 5): 


(60) 


Let us now compare ће SABR dynamics with that of the diffusion оп ће SABR 
plane. In order to find the dynamics of Brownian motion on the SABR plane we use 
the fact that there is a mapping (namely, (52)) of S? into H2. Using this mapping and 
Ito's lemma yields the following system 


1 
ах, = 2 Y?C (X) С' (X dt + Y;,C (X) d Wiss (61) 


dY, — Yid Z, , 
with the two Wiener processes W, апа Z, satisfying 
E [d W;d Z;] = pdt. (62) 
Note that this is not exactly the SABR model dynamics. Indeed, one can regard the 
SABR model as the perturbation of the Brownian motion on the SABR plane by the 
drift term — 5 Y2C (X;) C' (Х,) dt. 


As in the case of the Poincare plane, it is possible to represent the solution to the 
system (61) explicitly: 


X; du t 52 
[a Ca) = r f exp (z - =) awe), 


12 
Y, = Yo exp (z = 5) à 


(63) 
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Parenthetically, we note that, within Stratonovich's calculus, (61) can be written as 


ах, = Ү,С(Х,) odW,, 
dY, = Ү, odZ;. 


Therefore, the stochastic differential equations of the SABR model, if interpreted 
according to Stratonovich, describe the dynamics of Brownian motion on the SABR 
plane. 


3.3 Laplace-Beltrami Operator on the SABR Plane 


It will be convenient to use invariant notation. Let z! = x, z? = y, and let 
д„ = O/Oz", и = 1, 2, denote the corresponding partial derivatives. We denote 
the components of g^! by g"", and use g^! and g to raise and lower the indices: 
Zu = Juz”, OF = 9 д, = O/Oz,, where we sum over the repeated indices. 
Explicitly, 


9! = y! (COA + pCO), 
8 = у? (оС(х)д + д›). 


Consequently, the initial value problem (22) can be written in the following geometric 
form: 


д 1 
— = — H 
3s Kz (5, 2) zE? O,Kz (5, 2), 


Ку (0, :) = 6 (:- Z), 


(64) 


where ó(z — Z) = d(x — X, у — Y) denotes the two-dimensional Dirac's delta 
function. 

Recall that the Laplace-Beltrami operator A, on a Riemannian manifold M with 
metric tensor g is defined by 


| 1 д 
© /detg Ox" 


where f is a smooth function on M. It is a natural generalization of the familiar 
Laplace operator to spaces with non-Euclidean geometry. Its importance for prob- 
ability theory comes from the fact that it serves as the infinitesimal generator of 
Brownian motion on such spaces (see e.g. [7, 8, 13]). 


Ag f Өх” 


Кт gn at) | (65) 
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In the case of the Poincare plane, the Laplace-Beltrami operator has the form: 


o? a 
2 
An=y (+5). (66) 


As anticipated by our discussion in Sect. 3.2, this operator is closely related to the 
operator L in the normal SABR model. In fact, in this case, 


1 
Бе Ак (67) 
and thus the problem (24) turns out to be the initial value problem the heat equation 
on H?: 
OKz 1 Ak 
a oe (68) 
Kz (0,2) = ô (z — Z). 


The key fact is that the Green’s function for this equation can be represented in closed 
form, 


e 58/8 2 oo RR. 4 
(2Олє)?/? Y2 Јак) J/coshu — coshd (с, Z) 


ie A= u. (69) 


This formula was originally derived by McKean [16] (see also [13] and references 
therein). We have added the superscript Л to indicate that this Green's function 
is associated with the Poincare metric. In Appendix A we outline an elementary 
derivation of this fact. 

Let us now extend the discussion above to the general case. We note first that, 
except for the case of C(x) = 1, the operator O"O,, does not coincide with the 
Laplace-Beltrami operator A, on S? associated with the metric (51). It is, however, 
easy to verify that 


1 ð 8f 
H 2, р == 22. 2а Hv). oe 
еа = OP ( E) jos 
= 1 2 y Of 
d ек Ox 
and thus 
1 1 д 
L=- A; ^C = 
2% эйс" ees 
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where Lo is essentially the Laplace-Beltrami operator: 


1 
Lo — ^з , (70) 
and V (x) is lower order: 
1 д 
ү=————— CWC (х) —. (71) 
ле” Әх 


Let us first focus on the Laplace-Beltrami operator Ag. The key property of 
the Laplace-Beltrami operator is that it commutes with isometries of Riemannian 
manifolds. In particular, this implies that 


фо Ag = An 0 Q, (72) 


and, thus the Laplace-Beltrami operator A, is the pullback of A, under 0. As a 
consequence, the heat equation 


OK 1 Auk 
m a I 


on S? can be solved in closed form! The Green's function K p (s, z) of this equation 
is related to (69) by 


КЎ (s, 2) = det (Уф (Z)) Ks (s, $ (2)). (73) 
Explicitly, 
—se/8 2 oo =u? /2se 
КЎ (s,2) = у / тайи. 09 
(275=)3/2 J1 — p2 Y2C (X) Jo cosh u — cosh ó 


where 6 = 6 (z, Z) is the geodesic distance (54) on S?. This is the explicit represen- 
tation of the integral kernel of the operator Up (s). 


4 Asymptotic Expansion 


In principle, we have now completed our task of solving the initial value problem 
(24). Indeed, its solution is given by 


Kz (s, z) = Q (s) K3 (s, 2), (75) 


Probability Distribution in the SABR Model of Stochastic Volatility 19 


where Q (s) is the perturbation expansion given by (39). In order to produce clear 
results that can readily be used in practice we perform now a perturbation expansion 
on the expression above. Our method allows one to calculate the Green's function of 
the model to the desired order of accuracy. 

Let us start with the Green's function K » (s, z) which is defined on the Poincare 
plane. In Appendix B we derived an asymptotic expansion (117) for the heat kernel 
on the Poincare plane. After rescaling as in (106), we arrive at 


1 2 
К2 (8,2) = y exp ( 5) i 


d 1 /dcothd — 1 
x 1 1) \4+0()7)), 
sinh d ( 8 ( d? ы ) Ы ( )) 


where we have introduced a new variable, 


А se. (76) 


We can now extend the expression to the general Green's function K 2 (s, 2). Using 
(73) or (74) we find that K А (s, 2) has the following asymptotic expansion: 


КЎ (8,2) = 


1 x) 
ex x 
21A 1 — p2¥2C (X) e( 2A 
| ô 1 fócothó— 1 2 
xx; (1 1l o? +1)л+о( )). 


To complete the calculation in the case of general C(x) we need to take into 
account the contribution to the Green’s function coming from perturbation V defined 
in (71). Let us define the function: 


а (z, Z) = sinh (2, Z)V6(z, Z) 
____уС@) _ (f. du — -») 77 
-Arr Vx Си) ^0 DJ. ©? 


From (117) and (118), 


Kz (s, z) = (I + AV) К? (5, z) 


(ke. 4 5 Ке ® ©). (78) 


sinh ô 06 


1 
© л —р2Ү?С(Х) 
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which yields the following asymptotic formula for the Green's function: 


Kz (5,2) = 


2 
! exp ( 2 ) 
24A 1 — p Y2C (X) 2А 


| ô 1 ô 
^ sinh ó sinh б 4 


1  ócothó—1  3(1—ócothó) + 52 
9 
G+ 802 86 sinh б a) ло (х) M) 


In a way, this is the central result of this paper. It gives us a precise asymptotic 
behavior of the Green's function of the SABR model, as А — 0. 


5 Volatility Smile 


We are now ready to complete our analysis. Given the explicit form of the approximate 
Green’s function, we can calculate (via another asymptotic expansion) the marginal 
probability distribution. Comparing the result with the normal probability distribution 
allows us to find the implied normal and lognormal volatilities, as functions of the 
model parameters. We conclude this section by deriving explicit formulas for the 
case of the CEV model C (x) = x? and the shifted lognormal model C (x) = x +a. 


5.1 Marginal Transition Probability 


First, we integrate the asymptotic joint density over the terminal volatility variable 
Y to find the marginal density for the forward x. To within O (2?) Я 


оо 
Px (s, x, y) E Kz(s,z)dY 
0 


1 99 5 ó ô 
— n e 9 1 " 
2nA 1 — p? C (X) Jo sinh ó sinh ó 


= _ 2 
-&^(1+ => 1 3(1—dcothd)+6 )| = 


ó? sinh ó ү?` 80) 


Here the metric ô (z, Z) is defined implicitly by (54). We evaluate this integral asymp- 


totically by using Laplace's method (steepest descent). This analysis is carried out 
in Appendix B.2. The key step is to analyze the argument Y of the exponent 


1 
Ф@)= 5 0 z, (81) 
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in order to find the point Yo where this function is at a minumum. Let us introduce 


the notation: 
С= Jii du 
yJx CQ) 


Since yC(u) is basically the rescaled volatility at forward и, 1/6 represents the 
average volatility between today's forward x and at option's strike X. In other words, 
G represents how "easy" it is to reach the strike X. Some algebra shows that the 
minimum of (81) occurs at Yo = Yo (С, y), where 


Yo = уу — 206 +1. (82) 


The meaning of Yọ is clear: it is the “most likely value" of Y, and thus YoC (X) (when 
expressed in the original units) should be the leading contribution to the observed 
implied volatility. Also, let D (C) denote the value of (2, 7) with Y = Yo. Explicitly, 


Мад EC 
Ж | 


D (0 = log (83) 


The analysis in Appendix B.3 shows that the probability distribution for x is Gaussian 
in this minimum distance, at least to leading order. Specifically, it is shown there that 
to within O (2?) Я 


ОЕ. ' D^|], , »c'eD 
= ex 
ae УО КЫРЕ | од А21 


1 yC (х) р 6pyC' (x) 
——А|1+ 
8 | 2/1-pg1 /1— 212 
3(-9) | 3уС'0) 6- &)D sinh (D) doce 
I 2/1- gir D 


cosh (D) (84) 


where 


IQ = м2 – 2р6 +1 


= cosh D (C) — p sinh D (С). (85) 
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As this expression may be useful on its own, we rewrite it in terms of the original 
variables: 


1 1 р? C'(f)D 
Pr (T, f,o) = exp | 2605. 


„Отт oC (F) DP? 2vy/1— p? I 
1 : D 4 
E 42€ (0) 6poC' (f) 


cosh (D) 


2u/1—p I vyl- er 


3(1— 02)  3eC'(f) (5 – p) D \ sinh (D) 
+ Ter. (86 
( 1 2w/1— p? I? D ii 


where we have slightly abused the notation. This is the desired asymptotic form of 
the marginal probability distribution. 


5.2 Implied Volatility 


The normal implied volatility is given by Sect. 2.2, and we are thus left with the task 
of calculating the conditional second moment. Explicitly, 


оо 
м? бх) | үк; (s, 2 dY 
0 


1 9 og д б 
f e? /2X 1 4 
2nA 1 — p? C (X) Jo sinh ó sinh ó 


1 ócothó —1  3(1—ócothó) + 52 
-^(1+ 52 ТТ a) ar. (87) 


In Appendix B.3 we show that 


M; (s,x,y)= 


E ut a 
nA yC (X) P 2А 2/1— pI 
ET ; yC' (x) D n 2pyC' (x) 
8 2/1- P1 Ji-pPP 
— д? l = 1 
Е g^) | 2УС' (x) 3p 92 een... 
I /1 — p 12 D 


cosh (D) 
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Despite their complicated appearances, the two expressions have a lot in common, 
and their ratio has a rather simple form. After the dust settles, we find that 


ок (т, f, e = o?C (f 1 (Q) 
elia 20С' (f) (p cosh (D) — sinh (D)) Ir 
oC! (f) DI -2/1-— p? 2v 


or 


ox (т, о) 2 oC (f) I (Q 
oC’ (f) (pcosh (D) — sinh (D)) 5, 
1 eb 89 
| Ы oC! (f) DI + 24/1 р2 120 КА | "7 


This is a refinement of the original asymptotic expression for implied volatility in 
the SABR model. 

It is easy to apply this formula to the specific choice of the function C (f). In case 
of the stochastic СЕУ model, C (f) = f?, with 0 < 8 < 1. If 8 = 1, then 


T. f 
c= log (2). (90) 
For 0 « 6 < 1, 
ЕС Гк = Fi? 
p (91) 


In the shifted lognormal model, C (f£) = f + a, where a > 0. Consequently, 


ov fta 
[= = ve ( 4+). (92) 


5.3 Implied Volatility at Low Strikes 


Our analysis so far has been base on the assumption that we were boundary conditions 
at zero forward. In the case of p = 0, we can tackle the Dirichlet and Neumann 
boundary conditions explicitly. 

As explained in Sect. 2.3, the Green’s functions corresponding to the Dirichlet and 
Neumann boundary conditions at zero forward can easily be calculated, using the 
method of images, in terms of the Green’s function with free boundary conditions. 
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This, in turn, allows us to express the marginal probability distributions in terms 
of (86): 


pps (T, f. с) = РЕ (T, f с) = РЕ (т, =f, с), 


Neumann (93) 
РЕ (т, /,о)= Pr (т, o) + Pr (T, — fa o). 
Analogous formulas hold for the conditional second moments. We can now easily find 
asymptotic expressions for the implied volatilities corresponding to these boundary 
conditions. 
In order to keep the appearance of the otherwise unwieldy formulas reasonable, 
we shall introduce some additional notation. Let 


rar), (94) 
where "m 
0 1 / * du 
=- ; 95 
‘ y Јх Cu) m 


Furthermore, let us define the ratio 


I 
= з (96) 


and note that ^; « 1. Finally, we set: 


—1, for the Dirichlet boundary condition, 
n= 0, for the free boundary condition. (97) 


1, for the Neumann boundary condition. 


It is now easy to see that: 


7 1 
cC'(f)(pcosh (D) — sinh (D)) > 
| oC! (f) DI -2/1— р2 Iu P: | 


Itis worthwhile to note that for large strikes all three of these quantities are practically 
equal, and one might as well work with the free boundary condition expression. 


Indeed, in this case, y ^ 0, and so y 1 — ny + 1272 © 1. Also, we see from this 
expression that, at least asymptotically, 


(98) 


oDirichlet (т f g) < gt (т, fed) < Romam (т, f, o), (99) 
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This result is intuitively clear, and (98) quantifies it in a way that can be used for 
position management purposes. The decision which boundary condition to adopt 
should be made based on specific market conditions. 


Appendix A Heat Equation on the Poincare Plane 


In this appendix we present an elementary derivation of the explicit representation 
of the Green's function for the heat equation on Н2. This explicit formula has been 
known for a long time (see e.g. [16]), and we include its construction here in order 
to make our calculations self-contained. 


A.1 Lower Bound on the Laplace-Beltrami Operator 


We shall first establish a lower bound on the spectrum of the Laplace-Beltrami 
operator on the Poincare plane. Let H = L? (H°, d Ln) denote the Hilbert space of 
complex functions on Н? which are square integrable with respect to the measure 
(50). The inner product on this space is thus given by: 


ew - [ $972. (100) 
Н? y 


Itis easy to verify that the Laplace-Beltrami operator A; is self-adjoint with respect 
to this inner product. 
Consider now the first order differential operator О on H defined by 


ð 1 д 
o= bp- oi (101) 


Its hermitian adjoint with respect to (100) is 


" д 1 д 
Q' =b- o% (102) 


> (оо? + о'о) =-^ - 4. (103) 
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This implies that 


1 Р 1 1 
(| - 449) = „(Ф00!Ф) + „(Ф10'0Ф) +7 cero) 
= (0'Ф0'Ф) + (09109) + 1 (elo) 
2 2 4 
1 
> 20010), 


where we have used the fact that (v|W) > 0, for all functions V є H. Asa 
consequence, we have established that the spectrum of the operator — A; is bounded 
from below by i! This fact was first proved in [16]. 


A.2 Construction of the Green’s Function 
Let us now consider the the following initial value problem: 


д 
— С ,2)— AnG "m 
As z (5, 2) n Gz (5, 2) (104) 
Gz (0, z) = Y?ó (z 7), 
where z, Z € HÊ. In addition, we require that 


Gz(s,z)— 0, asd (2, 4) — oo. (105) 


Note that, up to the factor of Y? in front of the delta function and a trivial time 
rescaling, this is exactly the initial value problem (68): 


Gz (s, z) = Y?Kz (2s/e, z). (106) 
The Green's function Gz (s, z) is also referred to as the heat kernel^ on H?. The 
reason for inserting the factor of Y? in front of б (< — Z) is that the distribution 


Y?6 (z — Z) is invariant under the action (44) of the Lie group SL (2, IR). In fact, we 
verify readily that 


Ү?б(@—7)= 1 ô (coshd (z, Z) — 1). 


^Tt is the integral kernel of the semigroup of operators generated by the heat equation. 
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Now, since the initial value problem (105) is invariant under 50 (2, R), its solution 


must be invariant and thus a function of d (z, Z) only. Let r = coshd (z, Z), and 
write Gz (s, z) = v (s, г). Then the heat equation in (105) takes the form 


д 2 д? д 
gren (r — 1) пата er). (107) 


We have established above that the operator — А), is self-adjoint on the Hilbert space 
H, and its spectrum is bounded from below by 1 . Therefore, we shall seek the solution 
as the Laplace transform 


y(s,r) = 7 e AL (A,r) da (108) 
1/4 


which yields the following ordinary differential equation: 


(1 2) d о) heart poi medic ey cu (109) 
=?) r) — 2r— г) — г) = 0. 
dr? c dr ^C | 

We write 
A-—-—v(v-cl, 
where 
1 
y—-—-—-ci ES 
2 V^ 74 
1, 
= ——+iw, 
2 


and recognize in (109) the Legendre equation. Note that, as a consequence of the 
inequality А > l wis real and Rev = -j. 

In the remainder of this appendix, we will use the well known properties of the 
solutions to the Legendre equation, and follow Chaps. 7 and 8 of Lebedev's book on 
special functions [15]. The general solution to (109) is a linear combination of the 
Legendre functions of the first and second kinds, P—1/2+iw (r) and Q-—1/2+iw (r), 
respectively: 


1 
L (G ie", ) = Ay P12tiw r) + Bu Q-1/2+iw (r). (110) 


As d — 0 (which is equivalent to r — 1), 


Q-1/2+iw (cosh d) ~ const logd, (111) 
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which would imply that ọ (s, cosh d) is singular at d = 0, for all values of s > 0. 
Since this is impossible, we conclude that B,, — 0. Note that, on the other hand, 


P_i/2+iw (1) = 1, (112) 


i.e. P_1/2+iw (cosh d) is non-singular at d = 0. We will now invoke the Mehler-Fock 
transformation of a function?: 


Ў) = А Ў) Piprtin (7) dr, (113) 
f= f fw) P_1/2+iw (r) w tanh (тш) dw. (114) 
0 


In particular, (112) implies that the Mehler-Fock transform of 6 (r — 1) is 1, and thus 
(remember that we need to divide ô (r — 1) by т): 


Ay = de tanh (тш). 
2T 
Note that this relation can be viewed as a spectral representation for the unbounded 
self-adjoint Laplace-Beltrami operator on the Poincare plane. 
Now, the Legendre function of the first kind P-1/2+iw (r) has the following 
integral representation: 


„2, 2 sin (wu) 
P 424i, (cosh d) = — coth (тш) du, (115) 
T d  Xcoshu — cosh d 


which is valid for all real w. Therefore 


1 2 1 99 sin (wu) 
L{—+.0%, coshd | = du, 
4 V272 Ја J/coshu — cosh d 
and we can easily carry out the integration in (108) to obtain 


e I 2 oo PENAS 
P TEES d 
(Ans)? Јака.) /coshu — coshd (z, Z) 


Gz(s,z)= и. (116) 


This is McKean's closed form representation of the Green's function of the heat 
equation on the Poincare plane [16]. 
Going back to the original normalization conventions of (68) yields formula (69). 


3Strictly speaking, we will deal with distributions rather than functions. A rigor oriented reader can 
easily recast the following calculations into respectable mathematics. 
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Appendix B Some Asymptotic Expansions 


In this appendix we collect a number of asymptotic expansions used in this paper. 


B.1 Asymptotics of the McKean Kernel 


We shall first establish a short time asymptotic expansion of McKean's kernel. This 
expansion plays a key role in the analysis of the Green's function of the SABR model. 
In the right hand side of (116) we substitute и = /4sw + d?: 


e I2 адз оо e^" dw 
—————— e t 
413/2 fs 0 Vcosh J4sw + d? — cosh d 


Expanding the integrand in powers of s yields 


Gz (5,2) = 


1 | d 
> = x 
Vcosh /4sw + d? — cosh d sinh d 


1 dcothd — 1 
V2: 0 [s^ i. 
(== 4d? sw o (227) 


Integrating term by term over w we find that 


e 5/4 d 
Gz(s,z)= —— 
z (5,2) 475 TR ( =) * 


d | 1 dcothd — 1 o (s) 
Y sinhd 4 p PES 


and we thus obtain the following asymptotic expansion of the McKean kernel: 


Gz (5,0 1 d? 
5,2) = —— expl —— | x 
99 475 P 4s 


qe dy ышт +0 (s*) (117) 
sinhd 4 d и IE 
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Taking the derivative of Gz (s, z) with respect of d (z, Z) in the expansion above, 
we find that 


D oui ME: d? 
dd тт E x 


d d а 1 — d coth d 
—— + {1 — : 11 
"ril = +5 ( +3 Z )+0%] (118) 


B.2 Laplace’s Method 


Next we review the Laplace method (see e.g. [2, 3]) which allows one to evaluate 
approximately integrals of the form: 


ra f We POF du. (119) 
0 


We use this method in order to evaluate the marginal probability distribution for the 
Green’s function. 

In the integral (119), є is a small parameter, and f (u) and ф (и) are smooth 
functions on the interval [0, oo).Ó We also assume that ¢ (и) has a unique minimum 
ио inside the interval with Ф” (ug) > 0. The idea is that, as є — 0, the value of the 
integral is dominated by the quadratic approximation to ¢ (и) around ио. 

More precisely, we have the following asymptotic expansion. As e — 0, 


^. f (u)e 9 €/e аи = _2лє_ е—®(ио)/е у 
б o" (uo) 


| ТИ | /” (uo) 6 (uo) f (uo) 


2" (uo) 8" (ио)? 


/ (3) (3) 2 
f! (uo) Ф (uo) " Soh" (uo) ш +0 (2) . — (120) 
2ф" (uo) 24" (uo) 


To generate this expansion, we first expand f (и) and ¢ (и) in Taylor series around uo 
to orders 2 and 4, respectively (keep in mind that the first order term in the expansion 
of Ф (и) is zero). Then, expanding the regular terms in the exponential, we organize 
the integrand as e- 9 Qo) (uuo /2 times а polynomial in e. In the limit e — 0, the 
integral reduces to calculating moments of the Gaussian measure; the result is (120). 
It is straightforward to compute terms of order higher than 1 in e, even though the 
calculations become increasingly complex as the order increases. 


бү can be an arbitrary interval. 
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Finally, let us state a slight generalization of (120), which we use below. In the 
integral (119), we replace f (и) by f (и) + eg (и). Then, as e — 0, 


27€ 


Га Lf (и) + eg (и)]е®)/* аи = d (uo) g PUD/E x 
f” (uo) Ф (uo) f (uo) 
|. ee [ iu 29" (uo) 8" (uo)? 


/ (3) (3) 2 
f' (uo) Ф ш 2s 59" (uo) иШ " o (2) 

20" (uo) 240" (uo) 
(121) 


This formula follows immediately form (120). 


B.3 Application of Laplace's Method 


We shall now apply formula (121) to evaluate the integrals (80) and (87). Each of 
these integrals is of the form given by the right hand side of (121). We find easily 
that the minimum Yọ of the function 


o(Y) = : ô (z, 7)? 


is given by 


Yo = yV? — 2C +1, 


С= Ji du 
С yJx CQ) 
Also, we let D (C) denote the value of 6 (2, Z) with Y = Yo: 


JO CET Cop 
с | 


p 


where 


D (С) = log 


and 


1(Q = VQ - 296 4-1. 
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Finally, we note that the second derivative à" (Yo) of Ф (Y) with respect to Y is 


D 
(1 — 2) y? I sinh D’ 


o" (Yo) = 


where we have suppressed the argument ¢ in D (C) and 7 (C). Likewise, 


3D 
(1 — р?) 3? sinh D’ 


99 (Yo) = 


and 
3 (1 — D coth D) 12D 


@ (yp) = | 
pr” (Yo) (1 — PP n sinh? D  (1—)y^D sinh D 


It is actually easier to begin the calculation with (87). In order to evaluate the 
various terms on the right hand side of (121), let us define 


tease Dass) 
а sinh ó sinh ó 4 ] 


(р) | à Lj dcothd ~ 1 3 (1 — dcoth 8) + 0? 
I V sinh V8 852 Sósinhó 4: 
Then, after some manipulations we find that: 


_ [р yC' (x) D 
fd = sinh D (+; 2) 


D ) C' (x) (sinh (D) — pcosh (D)) 
sinh D 2 (1 _ p^ I? 


and 


f (бу) = ( 


А D 1— рсоћ D 3yC' (x) D 
f (0) = : 21.2 : 1+ 
sinh D 2 (1— p?) y?I D sinh D 2/1— p I 


( D ) C' (x) (sinh (D) — pcosh (D)) 
sinh D (1 RE pyr y 


, 
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and 


1 D 1 — Dcoth D 3(1— Dcoth D) + D? 
g (Yo) = e 1 + УуС' (x) . 
8 V sinh D D 2/1— р2 ID 


Putting all these together we find that 


5 il Eu 3C 
My бух, ) = = 96 00 Te] +2292 


À 

/ D 2 / 
| УС (x) РА pyC (x) зањр) 
1 


1 
+—А 
8 | 2/1-pg1 /1— 212 
( 


3(1—p?) | 2yC' (x) (302 — 4) D Y sinh (D) " 
« I  fi-pP p |?) 


as claimed in Sect. 5. 

Let us now compute (80). We note that the functions f and g in (121) occurring in 
this integral are obtained from the corresponding functions in (80) by dividing them 
by Y?. We thus define 


~ Y 
foy - £0, 
and (Y) 
g (Y) = —— 
Then, 
= f (Qo) 
f (Yo) = yn' 
2 2f(Yo F o) 
f Yo) = РЫ + ap | 
=y 6f (Yo) 47 Yo , f"(Yo 
f = yn yn Еа yn" 
and Ya 
000) = 222 


ynD' 
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Combining all the terms we find that 


Рх (5, х, у) = 


1 1 р? jns yC'(x) D 
ex 
Jami УС OO BR. PP | 723 


1 "(хур / 
ЬО ДК 


8 2/1—p 1 ү 212 
3(1—p*) ЗуС' (x) (5 – р?) D \ sinh (D) 5 
( Т + SRR ——|+0()}, 


as stated in Sect. 5. 
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Asymptotic Implied Volatility at the Second 
Order with Application to the SABR Model 


Louis Paulot 


Abstract We provide a general method to compute a Taylor expansion in time of 
implied volatility for stochastic volatility models, using a heat kernel expansion. 
Beyond the order 0 implied volatility which is already known, we compute the first 
order correction exactly at all strikes from the scalar coefficient of the heat kernel 
expansion. Furthermore, the first correction in the heat kernel expansion gives the 
second order correction for implied volatility, which we also give exactly at all strikes. 
As an application, we compute this asymptotic expansion at order 2 for the SABR 
model and compare it to the original formula. 


Keywords Stochastic volatility © Asymptotic expansion - Implied volatility - Heat 
kernel - SABR 


1 Introduction 


The most known model for pricing derivatives is the Black-Scholes-Merton model, 
where the underlying is supposed to follow a geometric Brownian motion. Popular 
extensions include local volatility models and stochastic volatility models. As an 
example the SABR model [6] combines the local volatility of the CEV model [4] and 
a lognormal volatility process. Closed formulas for European options can be obtained 
for a few models; it is the case of the СЕУ model or for a stochastic volatility example 
the Heston model [10]. These are however special cases and there are generally no 
closed form formulas. Finite difference methods or Monte-Carlo simulations can be 
used to price derivatives. Approximations have also been computed to achieve faster 
pricing, especially for calibration processes. 

For short maturities, Hagan, Kumar, Lesniewski and Woodward provide an 
approximation for the implied volatility of the SABR model they introduce [6]. 
Berestycki, Busca and Florent [2, 3] and Henry-Labordére [8] give general methods 
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to compute short maturity asymptotics of stochastic volatility models. These 
expansions give the implied volatility at first order in maturity. In addition some 
quantities are approximated by their value at the money, which can produce errors 
in the wings of the distributions. 

In this paper, we leverage on the heat kernel methods introduced for the study 
of stochastic processes by Varadhan [11, 12] and used for the SABR model in 
[7, 8]. Using the heat kernel expansion of DeWitt [5], we provide a method to com- 
pute exactly a Taylor expansion of the implied volatility at all strikes. The stochastic 
volatility diffusion is formulated as a diffusion on a Riemannian manifold. The geo- 
desic distance gives the implied volatility at null maturity. The multiplicative factor 
of the heat kernel expansion provides the first order (in time) correction to implied 
volatility. The first corrective term of the heat kernel is translated into the second 
order correction to implied volatility and similarly for higher order corrections. We 
perform a detailed computation up to order 2 of the Taylor expansion in time of 
implied volatility, without other approximations. 

More generally, our method can be used to approximate a stochastic volatility 
model by an other model for which a closed form solution exists, with an implied 
parameter computed as a Taylor expansion. 

As an application, we compute the asymptotic SABR volatility at order 2 and 
compare it to finite difference method results and to the original SABR expansion. 

Our results can be useful for pricing short maturities options or even long matu- 
rities options with low volatility of volatility. When the approximation is not valid, 
a numerical method such as a finite difference method (FDM) has to be used. When 
our approximation is valid, it gives much faster results. At very short maturities, the 
prices are even more precise. Calibration at short maturities appears to be more stable 
using this approximation. 

In Sect.2 we recast the financial model in physical and geometric terms and fix our 
conventions. In Sect.3 we use a heat kernel expansion to compute a short maturity 
expansion of Black or more generally CEV implied volatility. Finally in Sect. 4 we 
apply the method to the SABR model and compare the results to FDM and to the 
original formula. 


2 Diffusion Equation in Covariant Form 


A stochastic volatility model for some asset with pure diffusion (no jumps) is 
described by two risk-neutral processes: the asset price S and a variable V which 
describes the stochastic part of volatility. In the Heston model V would be the vari- 
ance whereas in the SABR model it is a factor of volatility. The diffusion is given by 
the stochastic differential equations 


dS = us(S)dt + os(S, V)dW| 
dV = py(V)dt + ay (V)dW2 (1) 
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where dW; and dW) are two standard Brownian processes with correlation p. 
The dependence of parameters in variables S and V we have written is the more 
common, it may be more general with all parameters depending on both variables. 

Stochastic volatility models can be seen as diffusions on a Riemann surface. More 
precisely, prices of securities are sections of a line bundle over this Riemann surface 
which are solutions of a diffusion (or heat) equation. 

A introduction to this subject and its applications to finance can be found in [9]. 
We present here the formalism and define all quantities we use in order to set our 
conventions. 


2.1 Diffusion Equation 


Let us consider a general model with n state variables X i(t) (which will be the 
spot and the volatility) which follow a pure diffusion process, without jumps. For 
simplicity we consider a European payoff of some maturity T. The price P(X (t), t) 
of such a payoff is the solution of a diffusion equation 


r hoss 
-3P = Р + SE дд;Р —rP (2) 


where X! is the covariance matrix, ш the drifts and r the numéraire rate. All coef- 
ficients can depend on state variables X'(r) and time t. Unless explicitly staten, 
we adopt Einstein sum convention: repeated indices are summed. The price of the 
European option is given by the solution of this equation with terminal boundary 
condition at maturity T given by the payoff. 

The covariance matrix X can be seen geometrically as the inverse g^ = X of 
a metric gj; on the space of variables. The diffusion equation describes the diffusion 
over a Riemannian manifold: the state of variables endowed with the metric gi; = 


(27; 
Examples 


1. The Black-Scholes equation in the monetary account numéraire with volatility с, 
risk free rate r and dividend yield q reads 


1 
—OÓ,P = (r —q)SOsP + 57 S 8süsP —rP 


This is Eq. (2) with u$ = (r — 4)5, X55 = 0252 andr =r. 
2. For the stochastic volatility model described by Eq. (1), j/ is a two-dimensional 
vector 
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The covariance matrix X7" is 


oal 9$ posov 
pasay O y? 
In what follows we restrict ourselves to the case of time-homogeneous mod- 


els: there is no explicit time-dependence in parameters. The generalization to time- 
dependent cases is not difficult. 


2.2 Gauge Structure 


There are several gauge transformations which are natural for such systems: 
1. Change of numéraire: 
Р(Х, 1) 


P(X,t) — "TERT 


where Ф(Х, t) is the price of a security which is always nonzero. Mathematically 


itis areal function which is positive everywhere, that we denote thus by (X, t) = 
—(X,t) 
e . 


2. Change of variables 
X —— X'(X). 


The natural way to handle a system with gauge freedom is to introduce covariant 
derivatives. The coordinate freedom is handled through the Levi-Civita connection 
which acts respectively on scalars, vectors and 1-forms as 


Dif = Oif 
D; f! = 0 f! x Tj f" 
Di fj = ôi f; ГЕ f 
where pz are the Christoffel symbols. The action on tensors with more indices is 


obtained by acting on all indices with the Christoffel symbols. Christoffel symbols 
can be computed from the metric as 


1 
r5 = 59" (00) + дуди — д). 


A fundamental property of the Levi-Civita connection is the covariance of the metric: 


Digjx = 0. 
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The metric is used to transforms vectors into 1-forms and conversely, i.e. lowering 
or raising indices: 


Ai = gil Aj 
Ai = gij Al. 
The numéraire gauge freedom is handled through a line bundle £ (i.e. with sec- 
tions in R). Geometrically, P is a section of £. A R-valued connection! is defined 


with spatial and time components given by a 1-ѓогт А; and a scalar? О: 


Vi P = (Di — Ai)P 


Under the change of numéraire 
Р — её) р 
these operators аге covariant, 


УР — e& Dy p 
МР — e Dy, p, 


provided that A; and Q are shifted as 


Ai — Ai — ;ф 
Q — Q- ðo. 


Using these connections, the diffusion equation (2) can be rewritten as 
Loi 


Identifying terms between Eqs. (2) and (3), the R connection must be 


js | 
Ai — gij (-zri = s) 4) 
1 ij (; k 
О= =й (a4; – АА; — Tl AC) +r. (5) 


l This connection is similar to the connection which described the electromagnetic potential, except 
that the fibre of the gauge bundle is R instead of U(1). This causes a difference of a factor i in 
equations. 

>There is a breaking of symmetry between time and spatial directions. The diffusion equation can 
be seen as a non-relativistic limit of a pure wave equation in imaginary time. 
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In addition with " е 
g^ exu 


this translates the set of financial parameters into geometrical quantities. 


2.3 Kolmogorov Forward Equation 


The Kolmogorov backward Eq. (3) leads to a dual Kolmogorov forward equation. 

We suppose that all prices are expressed with respect to a numéraire which is a 
traded asset that does not pay any coupon or dividend. The price of the numéraire 
security itself is identically 1; this reads mathematically 


1. 
-V,l=<=V'Vil 
2 


If p(X, t) is a risk-neutral probability density to get in state X at time f starting 
from state Хо at time 0, then the price of a European payoff of maturity Т > t can 
be written as 


Р(Хо, 0) = [expe t)P(X,t). 


As t does not appear on the left-hand side, the derivative of the integral with respect 
to f must vanish. 


| хоо, t) P(X,t)) = 0. 


We define an action of the gauge group on р with a plus sign instead of a minus sign 
when acting on P: 


Vip = (Di + Ai)p 
Vip = (д, + Q)p. 


This means that they p and P have opposite charges under the numéraire R gauge 
group, such that pP is neutral and V; (pP) = 0;(pP). We have thus 


Јах (У,р(Х, t) P(X, t) + p(X, 1) У, P(X, t) = 0. 


Using Eq. (3) for У, Р(Х, t) and integrating by part on the spatial directions, this 
equation becomes 


Јах (vo, i= SVIVipQX, n) Р(Х,1) = 0. (6) 
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This equation will be automatically satisfied if 
loi 
У,р = М Vip. (7) 


Moreover, if the market is complete Eq. (6) must be true for all functions P (-, t) which 
imposes Eq. (7). This is the Kolmogorov forward equation, written in a covariant way. 

It should be noted that p(X, t) is a density, which means that the Levi-Civita 
connection does not reduce to a partial derivative as would be the case for a scalar. 
More precisely, the transition probability р(Хо, 0; X, t) has value in £ X L* & 
^T (T* M). Numéraire gauge tranformations associated with the line bundle £ acting 
on p gives the well-known change of measure which are usually obtained from the 
Girsanov formula. 


3 Asymptotic Implied Volatility 


We consider a stochastic volatility model where the variable is a forward price or 
rate F with a volatility variable V: 


dF = ор(Е, V)dW, 
dV = py(V)dt + oy (V)dW; 


with (аиа) = pdt. 
Our computation of an asymptotic expansion at short time of implied volatility at 
strike K involves four steps: 


1. Compute an asymptotic value of the transition probability from initial state Fo, 
Vo at time 0 to K, V at time f using a heat kernel expansion; 

2. Compute Son (K, V)O(F — K)] using a saddle point method; 

3. Integrate over time to compute the time value; 

4. Compare to the same formula for the Black-Scholes model to extract the implied 
volatility. 


3.1 Heat Kernel Expansion 


Inorderto keep exposition as simpler and clear as possible, we will skip here technical 
details and refer the reader to [5, 13] for mathematically precise statements.? 


3In finance we will usually consider noncompact manifolds, possibly with boundaries as in the 
SABR model for 0 < 6 < 1. 
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At short time the solution of Eq. (7) with initial condition p(X, 0) = ó(X — Xo) 
is asymptotically given by a heat kernel expansion? 


JAX) | d" (Xo, X) 
PK D = on И As aV AQ. Р(Х Хе A Maio XO. (8) 
k>0 
g(X) is the determinant of the metric at point X: 
g = Det(gij). 


d(Xo, X) is the geodesic distance between the starting point Хо and the end point 
X, this is the minimal distance between Хо and X. It can also be written as 


d?(Xo, X t AU 
4 (Хо, X) = min | dt gij X' X! 
t X(s) Jo : 


where the minimum is taken on all paths going from X(0) — Xo to X(t) — 
(This is independent of t.) We denote by C this geodesic path. A(Xo, X) is the Van 


Vleck-Morette determinant 
Lied? (Xu. X 
Det Lora (Xo, X) 
“2 axiax! 


v g(Xo)gCX) 


Р(Хо, X) is the parallel transport along the geodesic with respect to the R connection. 
It is such that its covariant derivative along the geodesic path is null: 


- | лоха - | лах! 
Р(Хо, Х) = е YC —e JC 


where the integral is computed on the geodesic path C. Finally, a; (Xo, X) are func- 
tions which are defined recursively with 


A(Xo, X) = 


dag = 1 


4Using Feynman path integral, the solution to Eq. (7) can be written up to some normalization factor 


as 1 
П ayes 7" 
[al gij X X! 4 A,X‘ 4 о) 
рох « архе 0 2 


where [DX] means integrating over all path X (s) going from X(0) = Xo to X(t) = X. The 
normalization factor is the inverse of the same quantity with the integral computed over all paths 
with starting point Xo, so that the total probability is 1. It is generally not possible to compute 
this integral exactly. However it gives some hints on the asymptotic solution at short time: the 
solution will be dominated by the path corresponding to the minimal value of the integrand inside 
the exponential, which will be close to the geodesic path. 
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and a;’s satisfy the differential equations 
з | ЕР 
(k + (Víd)aVi)a, = РА? бА = о) Рл!?а_1. 


Along a given geodesic curve parameterized by its geodesic distance from Хо, this 
equation reads 


T: 
(k + 4да)ак = plate бА — о) РА!?а 


which can be integrated as 


1 f? lon 
a= x] dig Pp AE (zv - 0) PA! 
0 


Functions a; are sections of a £ X £* bundle. The parallel transport P and the 
connexion with respect to the numéraire gauge group act on the second factor of this 
external product. (The first factor is related to the numéraire at f = 0.) Also note that 


ML is a scalar with respect to the Levi-Civita connection. 


In order to produce a first order expansion of the implied volatility, only the com- 
mon multiplicative factor of expansion (8) is needed. In order to compute a second 
order term for the implied volatility, we will also make use of the first corrective term 
aıt with 


1 ff ln 
ау = “| GPA’ бА — о) AUS (9) 
d Jo 2 


3.2 Expected Variance 


We now compute Bon (Fs, V)ó(F, — K )]. This quantity сап be written as an inte- 
gral over the terminal volatility variable У: 


iei. V)&(F, — к = n o2.(K, V)p(K, V; t) 


where p(F, V; t) is given by the heat kernel expansion (8) with X = (7) and 
n — 2. The integrand can be written as 
B 
—— —C-— Dt- o(t) 


1 
2 
K,V)p(K,V;t)— ——e t 10 
og )p( ) FT (10) 
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with 
В = 5d Vo; К, У)? (11) 
1 
C = —2In(or(K, V)) — 5 [In(g(K, V)) + In(A(Fo, Vo; К, V))] + MK, V) 
(12) 
D = —ay(K, V) (13) 


where M is the integral of the R connection 
M(K, V) = - In(P(K, V)) = n A;dX! 
C 


on C, the geodesic curve joining (Fo, Vo) to (K, V), and a, is given in Eq. (9) as an 
integral over the geodesic path. 

The integral over (10) will be dominated at short time by the B term. More 
precisely, it will be dominated by the volatility Vmin which minimizes B(K, V) = 
id (Fo, Vo; К, vy. This is the final volatility which minimizes the distance between 
the initial conditions and the strike K. Expanding all functions in the neighborhood 
of Vmin, where B’(Vmin) = 0, the integrand is 


B p" 
Du Yin Yin e t e 2 
2тї 


4 6 
1 EN 1 (6° _ c?) ву? _ lgo _ lOc ôV ЕЙ 1 po? ôV 
2 24 6 t 72 12 


+ o(t) + odd tems 


where derivatives are with respect to V, all functions B, C, D and their derivatives 
аге taken at (K, Vmin) and ôV = V — Vmin. When writing o(t), we have anticipated 
that after integration 8V? ~ 1. We have also anticipated that odd terms іп dV will 
not give contributions to the integral. 

Integrating over dV, and using that the first even moments of the standard normal 
distribution are М» = 1, M4 = 3 and Mg = 15, we get for the integral 


B 
1 ->-C-Dt 


6 
X 2nt B" 


1 „ D t 1 (4) 1 (3) / 3t 1 (3)2 15t 
| ; (€ E Ie (5 o^ С] ватт? gasto 


|2208, 0008 — K)] = 


Asymptotic Implied Volatility at the Second Order ... 47 


This can be rewritten as 


Boon xm 
== —C- Dt t o(t) 


eid. V,)6(F; — к) = e t (14) 
2rt 
with 
D 1 „ 
C =C + 5 In(B") (15) 
2 
2 1 1g? ВӘ) 5 ( BO 
Б=р+ c" eq e 
2B" 4 B" В” 12\ B" 
L les ce 1В® Ife : 
ато ш = лд al (16) 


where all derivatives аге with respect to V and all functions and their derivatives are 
taken at (К, Vinin). 
3.3 Time Value 


The price of a Call of maturity T and strike K can be written as the payoff integrated 
against the risk-neutral distribution: 


Call(K, Т) = e77 / dF (Е — К) p(F, T) 


where p(F, t) is the marginal probability density of F;. This can be written also as 
a double integral over forward and time as 


T 
Call(K, T) = eT G — К) + Jj ar | dt (Е — K)*Op(F, n| . (17) 
0 


As F; is a forward, it is a driftless process and the Kolmogorov forward equation 
reduces to 


1 
9 CF. 0 = 595 (si CF DPED) аз) 


where о}, «СР, D is the local (normal) volatility 


[02 (5, VOC — F)] 
p(F, 1) 


ei G1) = El oF (Fi, VL = F| = 
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Plugging the Kolmogorov equation (18) in Eq. (17) and integrating twice by part on 
the F variable, the Call price is finally obtained as an integral over time at strike K: 


T 
Call(K, Т) = e7'7 G -Ky 4 >| dt iei. V)&F, — 3l . (19) 
0 


Using expression (14) for the integrand, the integral over time can be computed: 


T 
sf dt |0208, 0908, — K) | = 
i cele ee [8 
z "E 4 E 3 
B B 


D t -— B —— 
m | —(t — 2B)e Caste (г) +oļ| Pe t 
iG 


where erfc is the complementary error function, equal to the cumulative of the stan- 
dard normal distribution up to A/2 factors: 


+00 5 
erfc(x) = e dye" = 2N (- 2x) | 
T Ix 


The asymptotic expansion of this function at +00 


ы pU d "na 
erfc(x) — 
b хут 2х2 4x4 Ou 


with x = ,/ В gives the asymptotic expansion of the time value: 


1 [Т ә 
5 | a Blake. vos — 1] 


T2 E In(B) — DT 3 TT) 
Т 2В , (20) 


3.4 Implied Volatility 


The final step consists in computing the same expansion for the Black-Scholes model, 
which is simpler as there is no stochastic volatility to be integrated. The metric is 
given by the inverse of the variance: 


1 


gFF = cipe 
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The Christoffel symbol is therefore 


1 
r£,--—. 
FF F 


The R-connection components are computed using (4) and (5): 


The geodesic distance is 


d(Fo, К) = 


1 
Fy OF Шу 


The Van Vleck—Morette determinant is simply 


A(Fo, К) = 1. 


The parallel transport is 


K 
-f dF Ar F 
P(Fo, К) =e “Fo тү? 


Putting all these elements together, the heat kernel expansion of p(K, t) is accord- 
ing to (8) 


1 = o? 
Eom zie E 
рК, 1) күк 8 (f) 


Multiplying by the local variance, we get 


Bss x = 
1 = = Css — Dpst + o(t) 

T2 2р2 

iesaoscr — К)| = о2к (К. = e d 

| д d JV 271 

(21) 
with 
B — 2 


~ 1 
Cps = —In(o) — 5 In(K Fo) 


быз 
BS = 8 
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(In fact formula (21) is exact: there is no o(t) correction and it can be integrated 
exactly to get the Black-Scholes formula.) 

Writing Eq. (20) for both the stochastic volatility model and the Black-Scholes 
model, the implied volatility is such that both quantities are equal: 


_ " 3 B E = 3 
+С+щ(В)+ DT + —Т = + Свз + Ш(Вв$) + DgsT + —T + o(T). 
2В Т 2Bps 
(22) 


ч|% 


Expanding the implied volatility с as a Taylor expansion 
o(K, T) = 00(К) + c1 (K)T + о2(К)Т? + о(Т?) 


and plugging this into Eq. (22) on the Black-Scholes side, we get 


1 K : 
: w( ) 1—27%т —2°? т?+3 (2) т? елау Т 
200T Fo 00 00 00 00 


на <a P gu S ya 300 y 
— = ІП П П 
2 ^ 202 Fo 00 8 In? K 


B - ~ 3 
= — +C +ln(B)+ DT + —T T). 
T СЕВ) ост 
Coefficients must be equal at each order in T, which gives our final expansion of the 
implied volatility. 
Power — І gives the order 0 implied volatility 


K K 
00 — = 
42B . d(Fo, Vo; K, Vmin) 
which was already obtained in [2, 8]. 
The first order correction is extracted from the constant term: 
o C + In(co4/ K Fo) (24) 


OQ 2B 


Finally the O(T) term gives the second order correction: 


2 2 

02 3 (ai 1 ~ С] 00 
= 0 +3— — — |. 25 
00 2 (2) »( т 00 8 ) (3) 


This gives our final result as the implied volatility expansion 


g=oo(1+ 2742774 0(7%). (26) 
00 00 
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We stress that this result is exact in strike: for a given strike, we have computed 
exactly the three first coefficients of the Taylor expansion. Moreover, contrary to 
other expansions, the order 1 expansion is extracted from the order 0 expansion of the 
probability. This technique allows us to extract a second order term for the implied 
volatility from the order 1 term in the probability expansion. This method can be 
used to compute the Taylor expansion of implied volatility up to any order, although 
the computation becomes more complicated and involves integrals of increasing 
dimension: the a, coefficient of the heat kernel expansion involves k + | integrals. 


3.5 At the Money 


The computation we have performed makes the implicit hypothesis that we are not 
exactly at the money: К #¢ Fo. Otherwise, the dominant term in the exponential 
would vanish and we could not use the asymptotic expansion of the erfc function at 
infinity. Precisely at the money, we should use instead a Taylor expansion in 0. As 
the implied volatility surface is smooth, we just take the limit of formulas (23), (24) 
апа (25) at K — Fo. If we perform instead the Taylor expansion of the erfc function 
at 0, we find only the two first orders 


e CC» 


Fo 
оер 
“ry = $(3 bur). 


Careful Taylor expansions of all quantities at the money can be used to check that this 
is indeed the limit of Eqs. (23) and (24). Moreover, it can be seen that the existence 
of these limit are conditions for formulas (24) and (25) to be convergent, as B goes to 
0 at the money (at order 2 in the geodesic distance, which means that the numerators 
must in fact vanish at order 2). 


со(Ео) = 


3.6 CEV Volatility 


Instead of Black volatility, the asymptotic expansion can be computed for other 
local volatility models. Without stochastic volatility, the SABR model reduces to the 
CEV model. The local volatility part of the model is thus taken into account exactly 
without introducing approximation besides the stochastic corrections. In view of our 
application to the SABR model, we will compute here a CEV implied volatility. 
There are closed formulas for this model, involving Bessel functions. This implied 
volatility can therefore be used in the CEV pricing formula in order to get the price 
of the option. 


52 
For a СЕУ model with parameter Зо and volatility factor с, such that 
dF —oF^dW, 


the function В, C and D are 
1 2 
Во = 552 In* (qo) 
= 1 
Co = – (о) — 580 In(K Fo) 


~ Bo(2 — Во)с? 
i= 1—80 p1- o 
8K 1-0 F) 


with 
f 1—8 
K! б — Fy po 
— — <l 
40 = 1 — Во 
K 
In| — = 1. 
(5) mi 


Formulas (23), (24) and (25) are modified as follows. 


Г] = |901 
A/2B | d(Fo, Vo; K, Vmin) 


со = 


c; С+ In(eo) + 550 In(K Fo) 
00 2B ` 


o З(оү 1 {— „оу  Bo(2— Boog 
= D+3 0 |. 
00 2 Noo 2B oo  8Kl-fo Е] Bo 


This gives the CEV implied volatility expansion 


o=oo(1+ 2742774 0(7%). 
00 00 
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(27) 


(28) 


(29) 


(30) 


The Black implied volatility formulas correspond to the special case до = 1. The 


Bachelier (i.e. normal) implied volatility would correspond to до = 0. 
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3.7 Generalization 


This technique can be generalized easily to other parameterizations of the options 
prices. Consider a model with local volatility or stochastic volatility, for which there 
are closed form formulas for European option prices. It can be used as a proxy in the 
following way. 


Denoting by z; the parameters of the model, compute B. (2;), C. (zi) and D. (zi), 
the quantities B, С and D of the asymptotic expansion (20) for this model at a 
given strike. 

Find parameters z® such that B, (20 ) = B (there can be several solutions). 
Choose a one-dimensional subset of the parameters z; = z;(A) which allows a 
wide range of option prices at the given strike and such that z; (0) — 2. 
Compute derivatives of B,.(z;), C,.(z;) and D,(z;) with respect to А at А = 0. We 
use the notation В, = B,(zi(0)), В, = ду Bx (zi (A)) i-o ... 

Write a Taylor expansion A(T) = АТ + АТ? + o(T?) and write the equality of 
the asymptotic expansion (20) for the model and the proxy model: 


В, + A1 T Bi XOT? Bi + AIT? BY 
Е Т 
2 T - 3 
+С. АТС, + (В, + A:T BL) + Dy + op s + o(T). 
* 


B ~ ~ 3 
= -C-cIn(B DT T 
zT + In(B) + Tum 


This gives the Taylor expansion of A: 


_ C С, + In(B) – In(B,) 

= 5 

b- B, - X - X0, - Dia] 
B! f 


* 


Л 


БЕ 


Plug parameters z; (А Т + A2 T?) into the closed form option price of the proxy 
model to get an approximate price of the option in the real model. 


The closer the models are, the better the approximation is. It is clear that if the proxy 
model is the real model itself, there are no corrections at all. This procedure consists 
in approximating only the differences between models at a given strike and not the 
option price itself. In the basic case of Sect. 3.4 where the proxy model is the Black- 
Scholes model, the approximation leverages on the fact that the volatility surface is 
more regular than the option price. 
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4 SABR Model 


4.1 Model 


The SABR Model [6] is a stochastic volatility model where the volatility is a local 
volatility function multiplied by a lognormal stochastic volatility: 


dF = VC(F)dW| 
dV =vVdW2 


with (dAW1d W2) = pdt. The initial value for V is the parameter? a: 
а = V(0). 
C(F) is a local volatility function, which is generally 
C(F) = Е 


B is а number between 0 and 1 which controls the local skew. 0 corresponds to а 
normal process and 1 to a lognormal process. The implied volatility at time 0 and at 
the money is the local volatility AF. 

Depending on the parameters, the origin F = 0 could be reached with finite 
probability in finite time. For example this happens for the CEV process (i.e. even 
without stochastic volatility) for 8 < 1.If F models а positive variable, a boundary 
condition must be imposed. The asymptotic expansion does not distinguish between 
different boundary conditions, as the computation is local around the geodesic path. 
Itis valid as long as this geodesic does not reach the boundary. However the maturity 
validity range may be reduced for low strikes, when the probability of reflection or 
absorbtion at the origin modifies the probability distribution at the strike considered 
in a significant way. 

In the following sections, we compute the asymptotic expansion for the SABR 
model. This short maturity expansion is valid when both a*T and v?T are small 
enough in front of 1. If ones uses CEV implied volatility instead of lognormal implied 
volatility, the expansion is in v?T only. Numerical experiments indicates that the 
approximation remains very good for 2T < 1. 


We use the standard notation of o for the initial value of the volatility variable in the SABR model 
instead of Vo as in the previous section. 
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4.2 Order 0: Metric 


In order to compute the order 0 implied volatility, the only geometric object involved 
is the metric. According to the dictionary of Sect. 2.2, its inverse is the covariance 
matrix 


i) = V?C(FY pvV?C(F) 
(9 ~\ pv2c(F) 2v? J 


This matrix is first simplified by changing the variable F to 


-[ a (31) 
1= Ja OU 


which for C(F) = Е? reads for 8 Æ 1 
1-£ 1—8 
и гы 
1—8 


and for 8 = 1 


1 F 

= Inj — |. 

q Fo 

In addition, we rescale the time such that v disappears of the equations while 


keeping the same solution of the equations (the variances which are the physical 
quantities are not changed): 


At the end of the computation, the inverse transformation must be applied to the 
implied volatility: 


OY q= g 


The matrix in the set of variables (q, V) after this rescaling is 


()«n( t) 
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This is diagonalized by going from variables (9, V) to (x, y) with 


9 -—pV 
pou ME E AN 
v1- p 

y — V. 


and its inverse is the metric 


(v) = s (5 1) 


which corresponds to the infinitesimal distance 


ds? — Rui da 

y 
This geometry corresponds to the hyperbolic plane, in the Poincaré half-plane rep- 
resentation (y > 0) [7, 8]. Geodesics are vertical lines and semi-circles orthogonal 
to the y = 0 axis. The geodesic distance between two points (x1, y1) and (x2, y2) 
can be computed: 


(x2 = x1)? + (2 — nr) 


4(х\, yii х2, y2) = eost (1 4 
2y1y2 


In the (q, V) variables, going from q = 0, V = a to q, V the geodesic distance is 


2 2 
y- — 2pq(V — 
40, оса, V) = cost! (1.4. 4 t al pat 2), 


2(1 — p2)aV 


For a given strike, i.e. a given q, it is minimized by the volatility 


Vmin = y a? + 2paq + 42 


and the minimal distance is 


RE — 72 т 
d(0, o; 4) = cosh7! (ene) = (mm) . 


(=p a (1+ р)а 
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Equation (23) gives the order 0 implied volatility 
K 
In| — 
Fo 


(m 
(1+ p)o 


со = 


(We have dropped the absolute values as the numerator and the denominator have 
the same sign.) 

Plugging the expression for Vmin and going back to the original time, with v 
factors, the order 0 implied volatility for the SABR model is 


K 
v Inf — 
(5) 


ср = (32) 
In Va + 2pavq + 24? + pa + qv 
(1+ p)a 
with 
K'-8 = p^ 


At the money, the limit of this expression is simply 


8-1 
oo(Fo) = aF) 


which is the local volatility. 


4.3 Order 1: Connection 


To compute the order 1 correction, we need the scalar factor in the time value expan- 
sion, given by C in Eq. (15), with C given in Eq. (12). 
For the hyperbolic plane, the Van Vleck-Morette determinant can be computed 
as a function of the geodesic distance: 
d 
^ sinh(d)' 


We need also 


or(K, V) = VC(K) = VK? 
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(К.У) = 1 Е 1 
JU 7 yickya-p) Vk р?) 
Using that d” = 0 for the minimum distance at V = Vmin, we compute 
B" = dd" = d 
aVinin(1 — p?) sinh(d) 
Terms simplify against each other to give at V = Vmin 
C = —In( а K^) +M (33) 


M is the integral of the connection 1-form on the geodesic. According to formula 
(4), the connection is given by 


_ pC'(F) 


In fact, A can be rewritten in (x, y) variables as 


С'(Е) 


—— d 
2/1— p ш 


A= 


It must be integrated from 


If 8 = 1, the 1-form 


is exact and can be integrated directly: 


q-pVt+tpa 1 K p K 
= E] 1 -V | 
a 2 (1 — 02) 2 (=) е 2(1— ø) (> (7) +a) 


We consider now the general case where 8 < 1. If x; = x2, the geodesic is a 
vertical line. As A is along dx, its integral is zero: 


M=0. 
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In other cases, the first part of Eq. (34) is an exact form and can be integrated directly: 


i us ETT 1 (202) Е 8 (5) 
Toa) 2(1— р) © 21-) XO0)J 30-9) FJ" 


(35) 


The second part must be integrated on the geodesic path. The geodesic is a semi- 

circle with origin (X, 0), radius А and going through (x1, x2) and (yi, y2). The origin 

18 therefore 

xy —хү+уў—Ууї 
2(x2 — x1) 


R= (| у? + (ху – XY. (37) 


We parameterize the geodesic by г = tan(0/2) where 0 is the angle on the circle: 


X= (36) 


and the radius 


gag tet 

b du — 

14:202 
_ 2t 

TU gm 


In this parametrization, the geodesic distance is given by 
ds = dIn(t) 
and we can compute 


pBFP p^ G pB 
= 0) Е) Gop fap 00 - 60 
(38) 


with 

G(t) = tan! (t) 
a 4- bX uA SETEC TEX- R) 
an а 

Ja bX (PZR Va bx)? — 1 = 8)? R? 
| + bx (а + bX)? = (1 — 8)?R? 
cR 4- t(a 4- b(X — R)) 

а+ЬХ uc cR -F t(a 4-b(X — R)) 

Jd — BRI — (a +ЬХ)? Va- BPR? – (a+ bXY 


bX)? -(-g8?Rgp? 


bX)? -—g8?Rg? 


(39) 


60 L. Paulot 


1-8 


а = Fy 
b= (1-81 2 
c = (1— 8)p 
В – х; +X 
ti = iani аа (40) 
R+x—X 
and i i 
FZ 
tanh !(z) = = 1 
an (z) т А 


which coincides with the inverse function of tanh on ] — 1; 1[. Summing Eqs. (35) 
and (38), the integral of the connection is finally 


8 (5) об 
М= =1 С Gl. 41 
n сөл 1909) - Gn) (41) 


Replacing M in Eq. (33) we get 


[G(z) - G())) 8«1 


~ 1 | / —.8)./1 — p 
€ = —5 In(aFy Vais K^) + кеды к= 


2 ) (rm) vas +a) В = 1 


2 (1 -P 

(42) 

Restoring the factor v, the order 1 correction is given by Eq. (24): 

Č + in(  /K Kj) 

el esed 4 (43) 

90 id Va + 2pavq + v?^q? + pat qv 

n 
(1+ p)a 


со is given by Eq. (32), C by Eq. (42) (with о divided by v, also inside Vmin), where 
X, R, t4 and f» are given in Eqs. (36), (37) and (40) from 


—pa vq — paz + 2pavq + v2q2 
cy- [Pe] «c ae 
n = y2 Va + 2pavq + vq? 
и 
и 


and С (t) is defined in formula (39). 
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Using this expression, the first order implied volatility is 
с = 00 ( + e + on) 
00 


which is valid for all positive strikes. Exactly at the money, the formula we give must 
be replaced by its limit, which can be computed by a Taylor expansion or numerically. 
At the money and only at the money it appears to be equal to the original HKLW 
formula: 


С 1 205-1) 1 8—1 1 1 
= FO) = ager BY Fy? + paw B Fy тп ы H 


This is not surprising as their expansion is in fact an expansion in both maturity and 
moneyness (eventually of order 0in moneyness). 


4.4 Order 2 


To compute the second order correction to implied volatility, we need to compute D 
as defined in Eq. (16), with D = —a; defined in Eq. (9). 

We have to compute a; as defined in Eq. (9). Most of the integration can be done 
analytically. We have first the integral of Q along the geodesic: 


According to Eq. (5), Q is 


a2 B у? 
= ( EHE 21 2) F20-5)" 


Using the values defined in the previous section for X, А, t1, t2, a, b and c, its integral 
along the geodesic is 


2 = 
0) _ 8 (1 B+ = B ) R H(t) — H(t) (45) 


ie — 9) (1 BR? — (a +ЬХ)? In(r;) — nÇ) 
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with 


a 4 b(R + X) + cRt 
(a+ bX)(1 +12) c bR(1 — t?) + 2cRt 


cR NET EL SRL — R) 
V(a+ bx)? — (1 — 8g? V(a+ bx)? — (1 — 8)2 2 


cR iue cR 4 t(a 4-b(X — R)) Р 2 
ЖОЛ ТП GG EXP tanh (s NUT RUE i) (а +ЬХ)? «(0— 8 R?. 


H= 


bX)? > (1 — 6) R? 


a 


+ 


Note that in the denominator, the quantity In(f2?) — ln (71) is up to a sign the geodesic 
distance d. If 9 = 1, ap reduces to 


R | x2 = x1 | 
(Q) 

== MC E 46 
Е 81 — pd (6) 


The Laplacian on the hyperbolic plane is in (x, y) coordinates 
D' D; = y (Oz + 02). 


As the Van Vleck-Morette determinant A = iml depends only on the geodesic 
distance s, its derivative on the orthogonal coordinate vanishes: 0; A = 0. On the 
other hand, by definition the parallel transport on the geodesic curve has no covariant 
derivative along the curve: У,Р = 0. As a consequence, there is no crossed term and 
both terms decouple: we have 


PAP iy (PA! = p-lyi vip + A712 pi Dal? 


(the R charge is carried only by P). 
The metric part can be integrated analytically [1, 7]: 


(R) 1 1 f'cosh(d) 1 
a= 1+ : А (47) 

8 d \sinh(d) d 
The last part to integrate is the R connection term P~!V‘V;P. As the action of 
gauge transformations on the heat kernel expansion is fully carried by the parallel 
transport term P, a; can only depend on gauge-invariant quantities constructed from 


Е = dA. We split therefore A = A® + A“) into a pure gauge part А and AC? 
such that F = dA“): 


AY = jd InCQ^) 


аш(с(куу— PO ay, 


AO = 
20 — p) 


ЕЕ: —— 
20 — p?) 
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Forgetting A® which is pure gauge (it can be checked by hand that A® will not 
contribute), we denote by Ф!) the A® part of the parallel transport: 


pO Е ala” = Је AW 
with 8 
AS 602) -G() 8«1 

MO = = E cs 


aber) 7 
20-98)V Am) +a) к=; 


Using M апа АЧ) which in (x, y) variables is 


C'(F 
Ai) = E( p = dx s) 
lp 


we can rewrite 
а I 2 2 2 (1) (1) ay)? 
5PIV'VIP = 55 – (д: +) м + (am -Al ) 
2 
+ (8,49 — AP) + aA” + oA] 
where the specific form of АЧ can be used to simplify terms 
0, A + 9А = 0. 


Computing M and A“ analytically, we compute numerically the x and y 
derivatives and integrate numerically along the geodesic curve to get 


a”) = =: |. ds y? |- (& + ә) мо + (aM — aw) 


+ (AM — дю) | К. (49) 


This integral can be computed by a numerical quadrature with few points. For 3 = 1 
the connection has in fact no curvature and therefore ab? = 0. 
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We compute also the following quantities (at V = Vmin): 


1 
Ho 
© a(l — p?) Vin sinh(d) 
B" = dd" 
B®) 3 
B" nid Vmin 
B® 12 ,(cosh(d) 1 
= 34" | — (50) 
B" ys sinh(d) d 


We need finally C’ and С”. Using our decomposition М = 5 In к. + MID, we сап 
write from formulas (12) and (15) 


4 B 1 4 1, , 
C = - 5 In(K F) 5 in( 2) + 20 ) -n(1- ø) + Mi. 


Its first and second derivatives in V at V = Vmin are 


3 
" ay 


2 B" 
2 
Eu. | cosh(d) 1 1B® 1/B® „ 
стега (1) . 
2 КЎ 2) "agp ale] т ie 


We choose to differentiate M® numerically, by finite difference, as the analytical 
expression is very long and hard to simplify. 


Du 


Simplifying С against —а(® we get finally 
~ 1 1 n 12 3 / 3 
_— 30) (A) а) а) а) 
D — —a а +5 + M =M + м 
: A 8 2B" | Vinin m 


with a(9 given in Eq. (45), a^ in (49), B" in (50) and M“ in (48) with Си) 

defined in Eq. (39). For д = 1, this expression can be simplified using Eq. (46) for 
(Q) 

a,~ and 
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We have finally obtained the second order corrective term 


2 
02 _ 3 fo 1 баз?! Е сў 
oo 2 Noo 2В 00 8 


such that we get the quadratic approximation to implied volatility 


o = 00 ( + TT 2r eom). 
0 0 


The computation has been done in redefined variables such that v — 1. To restore 
the v factors, a must be replaced by 5, T by v°T and the final implied volatility 
must be multiplied by v. 

At the money, the formula for Е looks divergent but its limit is well defined. We 
compute this limit numerically, although it could be done analytically. 


4.5 CEV Volatility 


The results of Sect. 3.6 can be used to invert the SABR volatility into a CEV fractional 
volatility. Using formulas of Sect. 3.6 the implied CEV volatility is computed and 
used in the closed-form option prices of the CEV model. 

This appears to be useful at low strikes for б < 1/2 or with small volatility of 
volatility: only the corrections to the CEV model which come from the stochastic 
volatility are approximated, not the local volatility part. For example, at the money 
the first order coefficient of the Black volatility which is given by Eq. (44) becomes 
for the CEV volatility 


т 1 8— 1 
| (Fo) = -pavb FË TE pa -p v. 
00 4 


The corrective term in o?T has disappeared. 


4.6 Numerical Results 


We present in Figs. 1 and 2 the implied volatility given by our expansion and compare 
it to the implied volatility computed by a two-dimensional finite difference method 
scheme. We also show for comparison the implied volatility given by the original 
formula of [6]. In this example, parameters are Fo = 4, a = 30%, 8 = 0.7, 
v = 40%, p = —0.5. The FDM scheme is a second order Yanenko scheme [14] 
with exponential fitting. We use 400 points in strike, 200 points in volatility and 30 
time steps. 
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Fig. 1 Implied volatility and relative error for the SABR model with parameters Fo = 4, œ = 30%, 


B = 0.7, v = 40%, р 


0.5 and maturities 2.5 yr, 5 yr, 7.5 yr and 10 yr. On the left, implied 


volatilities are plotted for our first order and second order expansions, the original formula of [6] 
and are compared to the result of a FDM solution. On the right are the relative errors with respect 


to this reference solution 
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Fig.2 Implied volatility and relative error for the SABR model with parameters Fo = 4, œ = 30%, 


B = 0.7, v = 40%, р 


0.5 and maturities 10 yr, 20 yr and 30 yr. On the /eft, implied volatilities 


are plotted for our first order and second order expansions, the original formula of [6] and are 
compared to the result of a FDM solution. On the right are the relative errors with respect to this 


reference solution 


At very short maturities, all expansions are acceptable as the expansion is domi- 
nated by the order 0 term. At first order our expansion is equal to the HKLW at the 
money but is more regular in strikes and is better in the wings as our computation 
does not involve any approximation in the moneyness. Our second order expansion is 
one order of magnitude more precise. When maturity grows, first order expansions 
lose precision but the second order remain relatively good up to 10 years, where 
v^T = 1.6. At higher maturities, the second order expansion explodes quadratically 
and finally gives even negative volatilities at very long maturity and low strikes. At 


SIn fact at very short maturities, the FDM scheme we use is less precise and less stable than this 
second order expansion, especially in the wings where the probability density is very small. 
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long maturities, a FDM or an other numerical method must be used, unless a valid 
long maturity expansion could be computed more efficiently. 


5 Conclusion 


We have presented a general method to compute a Taylor expansion in maturity 
of implied volatility for stochastic volatility models. We give exact formulas for 
the first and second order corrections. As an application, we have computed this 
expansion for the SABR model and compared it to the implied volatility given by a 
numerical scheme and to the original HKLW formula. It appears that it gives more 
precise results than the usual formula and extends the domain where a short maturity 
expansion can be used. Outside this range of validity, other methods must be used: 
numerical schemes or possibly other approximations. 

If a closer model with closed formulas than Black-Scholes exists, we provide a 
method to use this model as a proxy to extend the domain of validity of the expansion. 
It would be interesting to see the results of this method for the SABR model with a 
stochastic volatility model as a proxy. 

Obtaining exact option prices at all maturities would be a non-perturbative com- 
putation, which is a longstanding issue in theoretical physics. 


Acknowledgments We thank Erwan Curien for his interest in this work, Martial Millet, Xavier 
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Appendix: Mean reversion 


At long maturity, the SABR model is not realistic as the volatility process is a 
geometric Brownian motion. In particular the variance of the volatility increases 
linearly in time. A direct extension would be to add mean reversion to the volatility 
process, either on the volatility or on the variance. The asymptotic expansion can 
still be computed at order 2. However its domain of validity is usually reduced: 
in addition to other conditions, the maturity must be small compared to the mean 
reversion characteristic time. 
We impose mean reversion on the volatility process as 


dV =vVdW2 + «(V — V)dt 


The metric is not modified as it describes the diffusion part. Expressions for A and 
Q are modified as follows: 


pk(V — V) K(V — V) 
vV2(1— p2) FP v2V2(1 — р?) 


A = ASABR 
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1«V-Vy 1 V 1 pfBk(V —V) 
+ =к = к 7 
2v2V2(1—p*) 2 V 2v(1— g2)F1-8 


О = OsaBR + 


At first order, the integral of A along the geodesic is needed. Using the same 


notations as in Sect. 4.3 where variables have been rescaled such that v — 1, it gets 
an additional term 


V ft 
М = MSABR + pru 2 (ша!) — tan ()) x in( 2) 


The second order correction involves a one-dimensional integral which can be 


computed numerically by a quadrature with a few points, although it may be possible 
to get an analytical expression. 
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Unifying the BGM and SABR Models: 
A Short Ride in Hyperbolic Geometry 


Pierre Henry-Labordére 


Abstract In this paper, using a geometric method introduced in (Henry-Labordére 
Large Deviations and Asymptotic Methods in Finance (2015) [12]) and initiated by 
(Avellaneda et al. Risk Mag. (2002) [4]), we derive an asymptotic swaption implied 
volatility at the first-order for a general stochastic volatility Libor Market Model. 
This formula is useful to quickly calibrate a model to a full swaption matrix. We 
apply this formula to a specific model where the forward rates are assumed to follow 
a multi-dimensional CEV process correlated to a SABR process. For a caplet, this 
model degenerates to the classical SABR model and our asymptotic swaption implied 
volatility reduces naturally to the Hagan-al formula (Hagan et al. Willmott Mag. 88— 
108 (2002) [11]). The geometry underlying this model is the hyperbolic manifold 
H”+! with n the number of Libor forward rates. 


Keywords Heat kernel expansion - Hyperbolic geometry * Asymptotic smile 
formula * Stochastic Libor market model 


1 Introduction 


The BGM model [6, 14] has recently been the focus of much attention as it gives 
a theoretical justification for pricing caps-floors using the classical Black-Scholes 
formula. The basic (physical) random variables are given by the Libor forward rates 
which are assumed to follow a correlated log-normal process. As the forward swap 
rate model implied by the BGM model is quite complicated (the swap forward rate 
is not log-normally distributed), the calibration to a swaption matrix is difficult. An 
asymptotic swaption implied volatility (at the zero-order in the swaption maturity) 
was initially derived by Rebonato [17], Hull and White [9] for the BGM model. 
Despite its great success, the BGM model presents the same drawbacks as the 
classical Black-Scholes theory: as the forward rates follow a correlated log-normal 
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process, the model is not able to calibrate the full swaption matrix in/out-the money 
(in particular the caplets smile) and give a good dynamics to the Libor rates. The 
incorporation of a swaption smile can be obtained by introducing more elaborated 
models which should be flexible enough to calibrate caplets and a grid of swaption 
volatilities (not necessary at the money) across all swaption expiries and underlying 
swap maturities. One property that these models must still share is their ability to 
quickly calibrate the swaption matrix without using complicated numerical routines 
such as Monte-Carlo simulation which are usually noisy and time-consuming. In this 
context, Andersen-Andreasen introduced the CEV Libor Market Model (LMM) [1] 
which assumes that each Libor forward rate follows a CEV process, and showed how 
to obtain an asymptotic swaption smile. Their method is still based on the Rebonato 
"freezing" argument which consists in assuming that the ratio of a forward Libor rate 
over the swap rate and the derivative of the swap rate according to a forward Libor 
rate are almost constant (and therefore equal to their values at the spot). 

Recently, for this specific model, Kawai found a more accurate asymptotic formula 
using the Wiener chaos expansion [15]. Although giving more flexibility than the 
BGM model, the CEV LMM model is still not able to calibrate the swaption matrix 
for in/out strikes and in this context, we are naturally led to use stochastic volatility 
LMM. The literature on this subject is not particularly large. Andersen-al introduced 
a LMM where the Libors follow a multi-dimensional correlated CEV process cou- 
pled (but uncorrelated) to a Heston model [2, 3] and recently Piterbarg modifies this 
model by allowing the model parameters to be time-dependent [16]. Using an aver- 
aging principle, which consists in replacing the time-dependent parameters by some 
effective constant parameters, Piterbarg derives an asymptotic volatility. Note that as 
these models are uncorrelated to the stochastic volatility, the swaption fair value is 
simply given by the fair price in the case of a local volatility model conditional to the 
stochastic volatility process as explained by the Hull and White decomposition [10]. 
An asymptotic expression can then be generated by approximating the moments of 
the volatility process [2]. 

For pricing exotic options (such as Bermudan swaptions for example), itis simpler 
or more natural to model directly the forward swap rate with a stochastic volatility 
process. For example, the SABR model [11] was introduced to fulfill this goal. An 
asymptotic swaption smile formula (at the first-order) was derived for this specific 
model and helps to calibrate quickly the model to liquid market data. In this con- 
text, it is natural to try to reconcile/unify both benchmark models, the BGM and 
SABR models. We therefore introduce a LMM where the forward rates follow a 
multi-dimensional CEV process (with one beta for each Libor forward rate) corre- 
lated to a SABR model. As it is the case for the SABR model, we impose that the 
Libors are correlated to an unique volatility and it is therefore not possible to follow 
the Andersen-al [3] method (i.e. the Hull and White decomposition) to derive an 
asymptotic swaption smile. 

In this paper, we pursue our previous work on the application of the heat kernel 
expansion on a Riemannian manifold endowed with an Abelian connection [12] 
to derive an asymptotic smile formula for a swaption. The plan of this paper is as 
follows: in the first part, we will recall some definitions and present a list of recent 
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Libor Market Models. In the second part, we apply the heat kernel expansion to 
derive an asymptotic swaption smile formula at the first-order valid for any LMM. 
In the third part, we present our stochastic LMM and apply this general formula. 
We will prove that the geometry underlying this model is the hyperbolic manifold 
H”+! with n the number of forward rates. Furthermore, we show that the “freezing” 
argument is no longer valid when we try to price a swaption in/out the money. 


2 Libor Market Models 


We denote by Fy(t) = F(t, Ty 1, Tk) with the forward rate resetting at Ту with 
Tk = Тұ — Ty the tenor. As the product of the bond P(t, Tk) with the forward 
rates Fi (t) is a difference of two bonds with maturity Т; and Ту, s UP Tk—1)— 


P(t, T,)), and therefore a traded asset, Fx is a (local) martingale under QF, the 
(forward) measure associated with the numéraire P(t, Tk). Therefore, we assume 
the following driftless dynamics 


dFy(t) = ок(ї)Фк(а, Fe)dWe , Vt < Тр, K=1,...,n 
dWydW; = py(t)dt 


with the initial conditions a(t = 0) = о and F(t = 0) = FQ . Throughout this 
paper, W denotes a Brownian under the forward measure. 

In order to achieve some flexibility, we assume that the (normal) local volatility 
Фк (а, Fx) depends on a hidden Markov process a (to be specified later) representing 
a stochastic volatility. We therefore assume that all the forward rates are coupled 
with the same stochastic volatility a. Table 1 presents a list of the different functional 
forms for Ф used in the literature. The BGM, (limited) CEV and shifted log-normal 
models correspond to local volatility models (a — 1) and the others to stochastic 
volatility models with a unique stochastic volatility a driven by a Heston process. 


Table 1 Examples of stochastic (or local) volatility Libor models 


Libor market model SDE 

BGM [6] аЕ = ox (t) Fed Wk 

CEV [1] dF = oy (t) Ff dW 

Limited CEV d Fy = ox(r) Fe min(F |, є? )dW, with e a small 
positive number 

Shifted log-normal dX, = ay (t) XxdWy with Fy = Xy + Qk 

FL-SV [2] d Fy = oy (t) (Bk Fk + (1 — Bx) FD) / vd Wy 
dv = A(v — А)а + v/vdZ; aWydZ = 0 

FL-TSS [16] dF; = ok (X (t) Fe + (1 — BG) Е0) vaW, 
dv = №№ — At + v/vdZ; aWydZ = 0 
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Note that the stochastic differential equation for the Libors rate Р has been written 
in the forward measure Q* and the stochastic equation for a remains the same in the 
forward or forward swap rate measures as a is assumed to be uncorrelated with the 
Libor rates. This will not be the case in our LMM. 


3 Asymptotic Swaption Smile 


We note sag the forward swap rate starting at Tą and expiring at Тз. The forward 
swap rate satisfies the following driftless dynamics in the forward-swap measure 


( ; gs 8 
Q^ (associated to the numéraire Cag(t) = 2 soul 7; P (t, T;)) 


SaB 
дЕ; 


dsag = ср (t) Ф, (а, Fy)dZy 


k=a+l 


Throughout this paper, W denotes a Brownian under the forward measure. 


The local volatility associated to the forward swap rate (dsyg = pu (t, Sag)dB) 
is then by definition 


B 
т " _ Os B Өзү B 
(с 20, К) = Eo? | >, pij (tci (oj (1) Pj (а, Fi) (а, Fj) a ај РЕ J 


ї,ј=а+1 OF; OFj 
: дѕав OSa 
3 Jig Фа, F)9 jla, Ер) Gg Dog? pa П, ав) 
= pij (toj (to; (t) i 
i, j=a+1 a fg раа, 4 Fi) 
(3.1) 
with the submanifold B = (((F;];,a)|sag = K} and p = p(F;,a, ПЕ, dosis 


conditional probability satisfying the (backward) Kolmogorov equation associated 
to the SDE for the Libors and the volatility a in the forward swap measure Q^?. An 
asymptotic expression in the short time limit for the local volatility соз (5, t) can 
be found in two steps: First by finding an asymptotic expansion for the conditional 
probability p (in 0%) and then by doing the integration over В. The first step 
is achieved via the heat kernel expansion technique summarized in Appendix A. 
and the second step using the Laplace saddle-point method explained briefly in 
Appendix B. 
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3.1 Saddle-Point 


0,2 
As the conditional probability at the zero-order is proportional to gw (see 
Appendix A) with d(x,x°) the geodesic distance between the points х = 
({Fi}i=t,....n,@) € R'*! and x? = UE o. ox а), the saddle-point corresponds 
to the point x on the submanifold sag = К which minimizes the geodesic distance 
d(x, x9) [4, 5] 


ЧЕ}, a*) = argminy py 44, [d x, x9] (3.2) 
Introducing a Lagrange multiplier А, this is equivalent to 


(57), a*) = argmin, р.а [d (x, x9)? + A(sag(F) — K)] (3.3) 


3.2 Asymptotic Local Volatility 


Plugging our asymptotic expression for the conditional probability (5.2) into (3.1) 
and doing the integration over IB using the Laplace method (see Appendix B), we 
finally obtain the local volatility at the first-order 


B 
OPE K = У) OGOGO fiy, a*) | 


i,j=atl 


n+l n+l 
Oy fi; СЕ, а*) | By (F*, a*) Ouv fij CF", а") 
m HJij L 76 2 TUVANA ae 
1+2 УА | fj Fea) ( V(F*, a*) 2. Aig fij (F*, a*) 


[n 7,d=1 


with fij(F,a) = Ф: (а, Е)Ф (а, Fj) ee ae v(F,a) = „АР and А = 
[Oud 21-1. g, P and A are defined in Appendix A and computed explicitly in Sect. 4. 
Note that as opposed to other asymptotic methods presented in the literature, this 
formula is exact at t — 0. A similar zero-order formula (independent of the time t 
for c; (t), pij (t) constant) was derived for a general multi-dimensional local volatility 
model by [4]. Moreover, in the expansion, we assumed that the time 7 is small but 
we have made no assumption that Fx is close to the spot Libor or that the volatility 
of volatility is small. 


76 P. Henry-Labordére 


3.3 Asymptotic Smile 


The asymptotic smile can be derived in two steps from the asymptotic local volatility: 
first, we have (50 = Sag(t = 0)) 


Mine Sog) c 
"0c 


dsag = (t, ѕо)аВ,; 


eL а, 50) 


and doing a change of local time t’ = h a (u, ѕо)2а и, We now obtain the associated 
local volatility model for the swap rate 


-aß 
аѕав = бус (Ё, So) dB, 


B 
B MR) 
with oj ^(t,s) = oe 
loc (f^ 5) av? (7,50) 


correspondence between this local volatility and the smile [12] given at the first- 
order by 


. Secondly we know that there is a one-to-one 


aBy2 
aß (op) (U, sodu ag 
овз(К, Ta) = (E = ов$(К)о 
: Е o2 aß 
х{1+ P] (Cioc) U, so)duag (К) (3.4) 
0 
In(£) 
oa (K)o = = ye Aa 
so C(x) 
essi = „(е9 <x) 1 Dope fao. 0) 
BS 2 
K d C(K) 2 C(fav) 
(Io с) (of) (0, 50) fav 
ы aß 0,K 
loc ew 


4 SABR-LMM Model 


We have seen that the asymptotic local and implied volatilities can be computed if we 
know the geodesic distance and a parametrization of geodesic curves on М"+!. This 
is the case for the hyperbolic space IH" for all n. This manifold has a lot of important 
properties. In the first part, we present our BGM-LMM-SABR model and show 
that the underlying geometry is H+! (with n the number of forward Libor rates). 
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Using this connection, we will find an asymptotic local volatility and an asymptotic 
swaption implied volatility. 


4.1 Dynamics 


We introduce the SABR-LMM model, given by the following SDE under the spot 


Libor measure Q (associated to the numéraire Bm (t) = n ug + Trj Fj (Tj-1)) 


j=l 
P(t, Ta(j—1) where B(t) = m if Tm-2 < t < Tm-1) 
dF, = a? By(t, F)dt + og (aC (Fed Zk 
da = vadZn41, dZidZj = pij(t)dt i, j =1,...,n+1 
with 


CF) = oe FP 


TN Y туркду@)о (CI CIC) 
ј=80) re 
Z is a correlated Brownian motion under the measure Q. Here we take the local 
volatility term Фу (a, Fx) of type ad F rr with фк € К. 

The functions C; (Ft) have been scaled by Фф and therefore we can impose that 
24 (0) = 0. The stochastic equation for a was written in the spot Libor measure in 
order to get a SDE independent of a specific underlying swap sag or a forward bond. 
Under the forward swap measure 0%, we have 


аЕ = a°bg(t, F)dt + ox (t)aCy (Fad Zp 
da = —V?a?b«(t, F)dt + vadZy41, dZidZj = pij(t)dt i, j =1,...,.n+1 


with 


P(t, Tj) max(k, j) 


Cap) , miner, tD 


Ti Pki Fi (t) OK (C) Ci (Fi) Ck CF) 
(1+ mri Fi) 


Ј=а+1 
B 
CaO = >, n PT) 
i=at+l 
B i 
Tk Ck (Pk) ркаск(ї) 
b(F,t) 2 У nw) У — 
i=a+l k=G(t) l+ n Fi) 
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i 1 
| ITi-ao UFFO 


and with w; (t) = . Here Z is a correlated Brownian motion 


B j 
Уа Пв (FA) 
under the measure 0%. Note that the forward-rate dynamics under the forward mea- 
sure Q* is much simpler and given by the following stochastic differential equations 
(SDE) 


dF,() = ок@аСк(Е)4%/к , ААМ, = prp (t)dt 


As it is the case for the BGM model, we can use a piecewise parametric form or a 
functional form for the serial volatilities о; (t) and the correlation pj; (1) (here full 
rank) as 


oi(t) = Nia (Ti — 0) + de PF + c] vr < Ti 
pij(t 0) = pp + (1 — pL)e Ав min[T; 1,7; 1 D|Ti-1—T;j-il 


The constants Nj are fixed such as c;(0) = 1. The model depends on 9 + 3n 
parameters (see Table2) which are calibrated on the swaption matrix. In the next 
subsection, we derive the metric, the geodesic distance and the Abelian connection 
underlying this model. 


4.2 Hyperbolic Geometry 


By definition (see Eq. (5.1a)), the infinitesimal distance (at t = 0) between the point 
ха and x, J- dx, (5.12) is given by (9 = [071], (i, j) = (4, ..., n), рі = [07 ia 
and p^" = [ B bas are the components of the inverse of the correlation matrix p) 


п+1 
а? = У, gagdxadxg 
а, ß=1 
2 c ag vdF; vd Fj 2 ja VdFi 
=. J J +2 а ! dat аа da? 
и?а? de Ci(Fi) Су (ЕЈ) 2? CGD 


Table 2 SABR-LMM: 9 + 3n parameters 
BGM parameters a, b,c, d, фі, PL, OA, OB 


CEV parameters Bi; i20, n 


SABR parameters OU; ай = 1.4 п 
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After some algebraic manipulations, we show that in the new coordinates [хк ]k=1...n+1 
(L is the Cholesky decomposition of the (reduced) correlation matrix: [p]j,;=1..n = 


[EET] 1л) 


n NT E gr! no. 
Xk = vik | L + рда, k= l,...,n 
2, ro Ci(F]) È | 


п 
od cd 
Xn = (p^ — > p" p^ pij)?a 
hj 


the metric becomes 


B 2 (p^ = pe pt ptt pij) >, ах? + йа 
m 2 2 
Ха+1 


ds? 


V 


Written in the coordinates [x;], the metric is therefore the standard hyperbolic 
2(gf4 — n s ia pja pj; ү 

(0 Bi p pij) This factor 
can be eliminated by doing a change of time t’ = р 17. In order to compute our 


saddle-point (3.3), we need the geodesic distance which is given in [18]. 


metric on H”*+! modulo a constant factor и = 


Proposition 4.2.1 The geodesic distance d(x, x") on Н"! is given by 


Yu - T 


(4.3) 
РЕ 


d(x, x?) = cosh^! [ d 


Using the geodesic distance on H”+! between the points x = ({ Е), a) and the 


. ITE . Fi dF! . 
initial point x9 = ({F°};, а) (qi = Sro Gc») given by 


i 


d(F, a|F?, а) = cosh! | 14 v^ Xi i-i 00414) + 2v(a — о) Xj- рі} + (a – oy p 
i i р | 200% — У з p" p^ pij aa 


In this model, the non-linear equation (3.3), satisfied by the saddle-point a* (s), q; (s) 
which implicitly depends on the swaption strike s, read: 


n n 
a* (s)? p^ = œ? p“ — 2va > 047 +17 > p!qiqi (4.4) 
i= ij=1 


(рѓа (ate) o) tX p! qt)d(q*, a*) Е X Əsas 
a*(s)(cosh(d(a*, (q*))?? — 1) a да 


\(a,q)=(a*,q*) (4.5) 
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with 


Fr 
qi = фу! x Pdx 
FD 


The saddle-point is determined by solving these non-linear equations (4.4) and (4.5) 
and an approximation (which could be used as a guess solution in a numerical opti- 
mization routine) is found by linearizing these equations around the spot Libor rates 
(i.e. qi — 0) 


X G)- КӨ м Р (4.6) 
Dae W pUq D pq 
F* Xa Agwj(S — so) 
т ULL 4 0(G- sy) (4.7) 
Е; >41 YpYgP pq 


with w; = ^us (qi = 0) and рі = gj — £72. Note that when the strike is close to 


at-the-money, the saddle-points are close to the spot Libors and a* = a. Moreover, 
by using the explicit expression for the hyperbolic distance, the Van-Vleck-Morette 
determinant is 


d(F, a|F9, a) 


A(F, a|F°, а) = 
Vcosh? (d (F, a| F9, a)) — 1 


4.3 Connection 


The Abelian connection is given by (5.1b) 


| 1 ш i bj(t, F) 290) ia 
A= aay 2^ Cor 2 _ 


1 (ue 9;Cj(Fj) 


Aa 
Cj(Fj) 2 


) ир“. (Е, t) 


N 


j=l 
where we have used that 


2" det[p]-? 


„8 = vat! Па С (Е) 


Unifying the BGM and SABR Models ... 81 


Finally, the Abelian 1-form connection, A = У? , Aid F; + Aada, is 


1x (bjt, F) Oj;Cj(Fj) E n | 
= : JT. idg; + gf) 
A XC 2 ) „Ўр dqi + p” da 
j=! E i=l 
n . 
— ba (t, Е) бэ p'* dqi + paa) 


i=l 


In order to compute the log of the parallel gauge transport In(P) (a, q|a) = Je A 
we need to know a parametrization of the geodesic curve on H”+!, However, we 
can directly find In(P) (a, q|o) if we approximate the drifts ру, (7, F) by their values 
at the Libor spots (and т = 0). A similar approximation was done in the Hagan-al 
formula [11] as was shown in [12]. Modulo this approximation, 


14 [f 50, Fù) | 8;C;09) 
In(P)(a, qla) ~ — >, ( m J (xs + p (a — | 
j=l "SE 


i=l 


— ba(0, F9) ( Урі“ + p" (a – 2 


i=l 


4.4 Asymptotic Smile—Summary 
The asymptotic local volatility is given by (3.4) 


B 
o. 2 


(сш) 0,5) = У pioi(Do;() fij (a, F) 


i,j=a+l 


8 T * - * * 
x ( +2 У ҮТ pru ) q 2 fu (Ps a ) WF x, a*) 


ceu fij (F*, a*) Sij(F*,a*) CF x, a*) 


п+1 


- Ў. андып sain 


7,0=1 1J 
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with (a* = a*(s), F* = (F7(s));) the saddle-point satisfying the Eqs. (4.4) and 
(4.5) approximated by (4.6) and (4.7) and 


Sag Wap 3 D eT 
fija. F) = a "Ci DC; E) BE ЭР; ‚ Vla, F) = /gAP, A% = [9,342] 


2 107419) + 2v(a — 0) 5. ., p/^qj + (a — a)? 
2008 — Уу? j= ріарЈаріз)ао 


ge b;(0, F9 дус} (F9) 
In(P)(a, q|o) ~ т >, ( a 5 2 +p а) (а = 2] 
с jV 


2 
V 
d(F,a|F9,o) = cosh”! I + 


j=l i=l 
Ме . 
— ba(0, F9) | v У o" a; + p"* (a – о) 
i=l 
d(F, al F®, 
A(F, a|F°, о) = a) 
cosh? (d(F, 000) = 1 
aH 
2% dag]? 
vE = 


val+n ру Ci (Е) 


Note that this expression is exact when t goes to zero. The smile at the first- 
order is then obtained by plugging the above expression into (3.4) with г = 
2 
A 
"n i Н t. 
2(p*a —5 7j kat pl^ pl pij) 


Remark 4.4.1 (Libor СЕУ model) Note that our model reduces for v goes to zero 
(and а = 1) to the Andersen-Andreasen CEV Libor model (with different СЕУ 
parameters for each Libors) and the above expressions degenerates into 


ij) = c cus) 2888 2 


n 
d(F)= |2 У рй) 


i,j=1 
b;(0, F°) un 


1 Ј lg 
n(P)(q) = a CE) Ee 
A(F, Е) = 1 

A= 23 det[p]-? 


Ii- Cif) 
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with the saddle-points (4.4) and (4.5) satisfying the non-linear equations (modulo 
the constraint sag = 5) 


Ӧдзав 
дд: qt 


pig; = 


4.5 Comments and Numerical Tests 


It is interesting to note that for n = 1, i.e. for a caplet, the caplet asymptotic smile 
reduces to the classical SABR formula by construction. Moreover, the asymptotic 
local volatility is given at the zero-order by 


дѕав (F*) Sap 


F* 
an p? 


PED = У pi OO у(та*?(Е*)С; (FC СЕЎ) 
i j=l 


with F* depending implicitly on s via (4.4) and (4.5). At this stage, it is useful 
to recall how a similar asymptotic local volatility is derived using the “freezing” 
argument. The forward swap rate satisfies the following SDE in the forward swap 
numéraire Q^ 


n 
дзав 
dsqg = — ay (t)aCy(Fy)d Z 4.8 
ов 2 (ас, (Fi) Zi (4.8) 
К=1 
The "freezing" argument consists in assuming that the terms Tal and es are 


almost constant. Therefore, the SDE (4.8) can be approximated by 


Os 


n 
a8 (r0 
аав = X ze (Foi) 
jai № 


Ск (ЕР) 
C, (59) 


Ск (s)d Zk 


and the local volatility is 


Ci (FD) Cj (EF?) sag 
CH) Crs F; 


бү ope У, pij (toi (0) оз (да? (s) 
i,j-l 

x (Ро) 2828 (poc. (sc ss) 

OF; : J 
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Table 3 Scenario A: Libor volatility A; (t) = 5 96 


P. Henry-Labordére 


Swaption Strike МС (%) Fl F2 

5x15 4% (ITM) 22.42 22.41 % (—1) 22.61 % (19) 
6% (АТМ) 20.33 20.41 % (8) 20.46 % (13) 
8 % (OTM) 18.92 18.93 % (1) 19.01 % (10) 

10 x 10 4% (ITM) 22.41 22.51% (11) 22.67 % (26) 
6 % (ATM) 20.38 20.41 % (3) 20.50 % (12) 
8 % (OTM) 18.93 18.93 % (—1) 19.05 96 (12) 


Libor L;(0) = 6%. 8 = 0.5 


Table 4 Scenario B: Libor volatility dL; = 0.25(0.17 + 0.002(7;—1 — 7)L?aW 


Swaption Strike МС (%) Fl F2 

5x15 5.08 % (ITM) 18.12 18.20 % (8) 18.17 96 (5) 
7.26 96 (ATM) 16.51 16.61 % (10) 16.63 % (12) 
9.44 96 (OTM) 15.38 15.38 % (0) 15.56 % (18) 

10 x 10 5.55 96 (ITM) 17.80 17.81 9? (1) 17.89 96 (9) 
7.93 96 (АТМ) 16.26 16.33 % (7) 16.38 96 (11) 
10.31 % (OTM) 15.17 15.19 % (2) 15.32% (15) 


Libor L;(0) = log(a + bi), Lo(0) = 5 96, L19(0) = 9%. 8 = 0.5 


We can reproduce this formula for the swaption smile at-the-money! as the 
saddle-point Libor rates coincides with the spot rates. This is not the case for in/out- 
the-money swaption. Therefore our expression (exact at the zero-order) shows that 
the freezing argument is no longer correct when we try to fit a swaption implied smile 
in/out-the-money. In the following, we have tested our asymptotic swaption formula 
at the zero-order with the same beta д; = б and v = 0 (Formula F1) against the 
Andersen-Andreasen asymptotic formula (Formula F2) [1] in the case v = 0. The 
accuracy of these approximations are examined using Monte-Carlo (MC) prices as a 
benchmark. Following [15], we consider five scenarii (see Tables3, 4, 5, 6 and 7). In 
the following tables, the implied volatility is reported and the numbers in brackets are 
the errors (in basis points i.e. true volatility times 10^) corresponding to the implied 
volatility computed using the F1 or F2 formula minus the MC implied volatility. An 
x x y swaption has an option maturity of x years, a swap length of y years and a 
tenor of one year. We set a time-step for Monte-Carlo б = 0.125 and 2!6 paths.” Our 
formula F1 is more accurate than F2. 


lAn at-the-money swaption (ATM) has a strike K equal to the spot rate sag (0) and an out-of-the 
money (OTM) (resp. in-the-money (ITM)) swaption has К < $455(0) (resp. К > Sq(0)). 
?We have used a predictor-corrector scheme with a Brownian bridge. 
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Table 5 Scenario C: Libor volatility dL; = 0.25(0.17 — 0.002(7;—1 — t) Le aw 


Swaption Strike MC (%) Fl F2 

5x15 5.08 % (ITM) 14.89 14.97 96 (8) 15.08 % (19) 
7.26 96 (ATM) 13.73 13.79 % (4) 13.81 % (8) 
9.44 96 (OTM) 12.92 12.91% (—1) 12.92 96 (0) 

10 x 10 5.55 96 (ITM) 14.52 14.53 % (1) 14.64 96 (12) 
7.93 96 (ATM) 13.33 13.38 96 (5) 13.40 % (7) 
10.31 % (OTM) |12.51 12.51% (0) 12.5496 (3) 


Libor L;(0) = log(a + bi). Lo(0) = 5 96, L19(0) = 9%. 8 = 0.5 


Table 6 Scenario D: dL; — 00514 Cr Wi | Jmm. b(t) = 
pe kiii) 4 ge-ka(Ti-170. р (г) = М1 = gie atii 
Swaption Strike MC (96) Е1 F2 
5x15 5.08 96 (ITM) 19.19 19.33 % (14) 19.38 % (19) 
726% (АТМ)  |17.59 17.72 % (13) 17.75 % (16) 
9.44 96 (OTM) | 16.46 16.49 % (3) 16.61 % (15) 
10 x 10 5.55 % (ITM) 18.92 18.94 96 (2) 19.06 96 (14) 
7.93 % (АТМ) 117.31 17.39% (8) 17.45 % (14) 
10.31 % (OTM) |16.18 16.21 96 (3) 16.32 % (14) 
p = 0.99, 0 = —0.99, kı = Кә = 0.54. Libor L;(0) = log(a + bi). Lo(0) = 5%, L19(0) = 9 96. 
8 — 0.5 


Table 7 Scenario E: Scenario D with 9 = 0.3 


Swaption Strike MC (96) Fl F2 

5x15 5.08 % (ITM) 33.09 33.65 % (56) 34.24 % (115) 
7.26 % (ATM) 29.47 29.96 % (49) 30.23 96 (76) 
9.44 96 (OTM) 26.92 27.14% (22) 27.49 % (57) 

10 x 10 5.55 96 (ITM) 31.75 32.41 % (66) 33.33 96 (158) 
7.93 96 (ITM) 28.47 28.88 96 (41) 29.37 96 (91) 
10.31 % (OTM) | 26.01 26.18 % (17) 26.68 % (67) 


5 Conclusion 


In this short note, we have introduced a LMM model coupled to a SABR stochastic 
volatility process. By using the heat kernel expansion technique in the short time 
limit, we have obtained an asymptotic swaption implied volatility at the first-order, 
compatible with the Hagan-al classical formula for caplets. Moreover, we have seen 
that this exact expression (when the expiry is very short) is incompatible with the 
analogous expression obtained using the freezing argument. 
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Appendix А: Heat Kernel Expansion 


An short-time expansion of the conditional probability for a multi-dimensional Itó 
diffusion process can be achieved using the heat kernel expansion. In that purpose, the 
Kolmogorov equation is rewritten as the heat kernel equation on a (7)-dimensional 
Riemannian manifold M” endowed with an Abelian connection as explained in 
[12, 13]. Let's assume that our multi-dimensional stochastic equations (in Q28) are 
written as 


dx, = b,(t, x)dt + o, (t, x)dW, 


with dW,,dW, = py,(t)dt. Then, the metric g,,, depends only on the diffusion terms 
c,, and the connection Ay on the drift terms b, as well 


p" (t) 
e(t, x)ow(t, x) 


n n+l 
Aalt, x) = (нен = У`в720, (ите), а = 1,...,п 


pel и=1 


dux) a sie dios dis pP” = Te Ie (5.1a) 


(5.1b) 


with g(t, x) = децо, (t, x)]. In terms of these functions, the asymptotic solution to 
the Kolmogorov equation in the short-time limit is given by 


px, tx?) = 


v B(x) ‘N(x, x9)P(x, х0)е n ( + Уа (x, и) 10 
(471) 2 


п=1 


(5.2) 
e Here, c (x, x?) is ће Synge function defined as 


d? : 0 
c(x, x9) = d(x, x) 
2 
The distance d(x, х0) is defined as the minimizer of 


dx, (t) dx,(t) 
dt dt 


T 
d(x, x°)? = min | 20 = 0, x) dt 
0 


and f parameterizes the curve C(x, х9) joining x(t = 0) = x9 and x(T) = х. 
e A(x, х0) is the so-called Van Vleck-Morette determinant 


O26 (x , x9) 


a a,0 080.2573 (5.3) 


A(x, х0) = g(0, х)? det(— 
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with g(x) = det[g,, (0, x)] 


e P(x, x?) is the parallel transport of the Abelian connection along the geodesic 


C(x, х0) from the point x to хо. 


Р(х, x?) = e7 со xy Au 70,204" (5.4) 


e The а; (х, x?) are smooth functions on M” and depend on geometric invariants 
such as the scalar curvature R. More details can be found in [12]. 


Appendix B: Saddle-Point Method 


The integration over В is obtained by using a saddle-point method which consists in 
approximating at the first order the integral f f (x)et? dx in the limit є large by [8] 


1 дез f 
є (- 2f 


даР 1 
+ (^а + e) Aog As 


даву $05, 0 
== ay Aap Aye Аш 


f 1a “~e>>l faeta (1 s Aag 


with Ао? = [9559], dx = IH. dxi and x* the saddle-point (which minimizes 
ф(х)). Here we have used Einstein summation convention. This expression can be 
obtained by developing ф(х) and f(x) in series around x*. The quadratic part in 
ф(х) leads to a Gaussian integration over x which can be performed. 
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Second Order Expansion for Implied 
Volatility in Two Factor Local Stochastic 
Volatility Models and Applications 

to the Dynamic A-Sabr Model 


Gérard Ben Arous and Peter Laurence 


Abstract Using an expansion of the transition density function of a two dimensional 
time inhomogeneous diffusion, we obtain the first and second order terms in the short 
time asymptotics of the local volatility function in a family of time inhomogeneous 
local-stochastic volatility models. With the local volatility function at our disposal, 
we show how recent results (Gatheral et al., Math. Financ. 22:591—620, 2012, [28]) 
for one dimensional diffusions can be applied to also determine expansions for call 
prices as well as for the implied volatility. The results are worked out in detail in 
the case of the dynamic Sabr model, thus generalizing earlier work by Hagan et al. 
(Wilmott Mag. 84—108, 2003, [31]), Hagan and Lesniewski (Springer Proceedings in 
Mathematics and Statistics, vol. 110, 2015, [32]) and by Henry-Labordére (Springer 
Proceedings in Mathematics and Statistics, vol. 110, 2015, Geometry, and Modeling 
in Finance. Chapman & Hall/CRC Financial Mathematics Series, 2008, [39, 40]). 


Keywords Implied volatility - Local volatility * Asymptotic expansion * Heat 
kernels 


1 Introduction 


Stochastic volatility models offer a widely accepted approach to incorporating into 
the modeling of option markets a flexibility that accounts for the implied volatility 
smile or skew. From a historical perspective he first models to be introduced into the 
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literature were the Hull and White model [37], the Stein and Stein model [50] and 
the Heston model [33]. In three of these models the underlying asset and its volatility 
are driven by Brownian motions that may or may not be instantaneously correlated. 
The correlation coefficient is taken to be a constant p. Then Bates introduced the 
first of a series of models incorporating jumps [7]. These were followed by work by 
Andersen and Andreasen [2]. 

In recent times there has been an explosion of models using the method of sto- 
chastic time changes to produce ever more versatile models [15, 16]. However purely 
diffusive models have retained of their popularity. A case in point is the introduc- 
tion into the literature of the dynamic SABR (stochastic alpha-beta-rho model), by 
Hagan, Lesniewski and Woodward [32] 


dF, = YO) F yd Wy 

dy, = v(t)yid War 

< ат, а, >= p(t)dt 
Fo = Fo; yo = а 


A possible shortcoming of the Sabr was the lack of mean reversion in the stochastic 
volatility. To address this a generalization was proposes by Henry-Labordére, who 
in [39] introduced the à Sabr model. In this new model the second equation is 
complemented by a mean-reverting term 


dyr = k(0 = yt dt + yid War 


The incorporation of a mean-reverting volatility was well received in the market 
place, since most practitioners think this feature is inherent in the volatility’s evolu- 
tion. 

In this paper we will consider a model which includes both the dynamic Sabr 
model and the A-Sabr models, in that we allow the volatility to be mean reverting 
and in addition for all parameters to depend explicitly on time. We explain how the 
approach via Laplace asymptotics needs to be adjusted in order to allow for this time 
dependence. The family of models considered also encompasses the so-called local 
volatility Heston models. 

Two major tools have been developed to evaluate European option prices in sto- 
chastic volatility models in closed form. The first is the so-called mixing formula, 
due to Hull and White [37] and Renault and Touzi [49]. This method is particularly 
powerful when applied to uncorrelated stochastic volatility models, since in this case, 
the price of the European option is fully determined, provided the law of the time 
integral of the instantaneous variance is known. In the language of stochastic time 
changes, we need to determine the law of the time change. The second tool, and the 
one closest to the approach taken in this paper, was introduced into mathematical 
finance by Kunitomo and Takahashi [38]. It is the method of asymptotic expansions. 
Hagan and Woodward [30] applied this method to find asymptotic expansions for 
the implied volatility of European options in a local volatility setting. Hagan, Kumar, 
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Lesniewski and Woodward [32] used asymptotics methods to obtain approximations 
for the implied volatility in the two factor Sabr models. In a Courant Institute lecture 
André Lesniewski [43] introduced the geometric approach to asymptotics, by relat- 
ing the underlying geometry of diffusion associated to the SABR model in the case 
B = Oto the Poincaré plane, a model of hyperbolic space, and outlined an approach to 
the asymptotics in stochastic volatility models via a WKB expansion. This approach 
was further developed in the important unpublished working paper [31] by Hagan, 
Lesniewski and Woodward. These authors used changes of variables to reduce the 
Sabr model with 6 Z 0 to a perturbed form of the Sabr model with 6 = 0 and 
then used the Hausdorff-Baker Campbell formula to find approximate solutions for 
the fundamental solution of the perturbed problem. In [39] Henry-Labordére's made 
contributions of both a theoretical and a practical nature. On the theoretical side 
he showed how an approach to small time asymptotics of fundamental solutions 
can be derived via the heat kernel expansion. This expansion was introduced by 
Minakshisundaram and Pleijel in [44], building on earlier work by Hadamard [29]. 
Using the heat kernel methods a fundamental solution for linear parabolic differential 
operators with variable coefficients can be obtained even in cases where the operator 
contains first order (i.e. drift) and zero order terms. On the practical side, as men- 
tioned earlier, Henry-Labordére introduced the A.-Sabr model, and showed how the 
heat kernel method yields asymptotic formulas for the fundamental solution and for 
the implied volatility and local volatility in this model. Also, in recent work, Forde 
and Jacquier [24, 25], have begun the rigorous investigation of implied volatility in 
both the short and long time limit, in the Heston model. In particular they obtained the 
first closed form near the money expression for the implied volatility in the Heston 
setting. This work was further extended in joint work with Mijatovié and with Lee 
[26, 27]. A different, i.e. fast mean reverting, regime was investigated in [23]. Also, 
Takahashi and collaborators have recently applied their Malliavin and Wiener-Itó 
iterated integral based approach to the A-Sabr model [51], and have demonstrated 
its effectiveness and versatility. Benhamou and Croissant [8] examine the concept 
of local time in the Sabr model and show that this valuation works using a Black- 
Scholes like formula in which complex quantities appear. Bourgade and Croissant 
[13], following an approach by Molchanov [47], in a working paper, applied the 
latter to a generalized p-homogeneous version of the Sabr model [13]. 

In a recent paper Gatheral, Hsu, Laurence, Ouyang and Wang [28] have re- 
considered the implied volatility expansions in local volatility models and developed 
highly accurate expansions up to the second order for the implied volatility. They 
have also pointed out the need to consider separate "regimes" in developing such 
expansions. A “near the money” small time limit regime was considered in an influ- 
ential paper by Megvedev and Scaillet [45] based on the nonlinear PDE approach 
of Berestycki et al. [11], extended to a class of jump diffusions. Gatheral et al. [28] 
considered two complimentary regimes, an “away from the money regime" and an 
at ће money regime. In the former, the limit is taken as the time to maturity т goes to 
Zero, for fixed value of the spot (or of the forward) and for fixed strike. This regime 
is associated to exponential decay of the difference between call prices and their 
intrinsic value as т — 0. On the practical side, the main contribution of [28] was 
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extending earlier asymptotic results for local volatility models to second order, so 
that they become capable to furnish highly accurate expansions both for time homo- 
geneous and fime inhomogeneous models. In addition, unlike earlier treatments (with 
the exception of Forde and Jacquier's treatment of the Heston model) the expansions 
in [28] were put on a rigorous mathematical basis. In particular it is shown that the 
terms in the expansion of the implied volatility can actually be interpreted as deriv- 
atives with respect to time to maturity, and for this reason, are in a certain sense 
optimal. 

In Sect. 4.2 we combine the results in [28] with the Gyóngy projection tech- 
nique and with the heat kernel method for time inhomogeneous diffusions to develop 
asymptotic expansions that are highly accurate and we show, using in part the results 
in [28] that these results extend to stochastic volatility models with time dependent 
parameters. 'That is to say, we describe in detail the first and second term in the 
expansion of the implied volatility in the limit of short times to expiration in two 
factor local-stochastic volatility models. The form of the resulting expansion is: 


e Second order expansion of local volatility o; ( f;, at, f, Т): 


oLl fiant, fiT) eol (fiant, f) ar dst for 
+o (fant, Pe Lae) r0 (11) 


where we have set t = T — t and where, in the time inhomogeneous case, we note 
that the coefficients о{?, i = 0, 1, 2, will depend explicitly on the spot time f, as 
well. 


e Order т?/? expansion of the call prices 


[CG. t, K, T) - (s — K)*] e" (1.2) 
= CU, K, N1? + CO (s, Kor +00”), 


e Second order expansion of implied volatility овѕ (/;, ar, f, Т); 


Ops (Cft. ar, t, /, Т) 
= of) (fr, as, t, f) + оу (fos ao ts Ут + оўу (О, а, t, ft? + О(т?) 
(1.3) 


The key to the proof will be determining the coefficients in the first expansion (for 
local volatility). With the local volatility up to second order in hand, we can apply 
the asymptotic expansion obtained for time inhomogeneous local volatility models 
in [28] to derive the implied volatility. One difference between this paper and earlier 
ones is that we do not make simplifications for the purpose of making formulas shorter 
or simpler. We take the point of view that the derivation of the full length formulas 
should be made clear and these should be presented at a first stage. At a second stage, 
one can explore ways to simplify or shorten the formulas. Since the formulas are all in 
closed form, up to quadrature, this requires at most approximating a one dimensional 
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integral by Simpson's rule and thus, even if the formulas are long their calculation 
is instantaneous. An illustration is given for the A-Sabr model. We devote a section 
to the asymptotics obtained in [28] for the one factor (local volatility) models, since 
these can be readily applied in the present setting once the local volatility function 
has been determined. 

As this work was nearing completion we learned of nice independent work by 
Louis Paulot [48] who also derives a second order asymptotic expansion for the 
implied volatility in two factor stochastic volatility models in the time homogeneous 
case. Paulot does not consider the time inhomogeneous case considered in this paper. 
Paulot takes a somewhat different (and very reasonable) approach to ours, in which 
he bypasses the computation of the local volatility function. For the local variance 
function (square of the local volatility), which he does not try to determine explicitly, 
his calculations stop at the first order (4.23) and do not seek to determine the next 
order (4.24). The determination of the /ocal volatility in stochastic volatility models 
is of independent interest and is provided by our method. Once the local volatility 
has been determined the determination of the call prices is a straightforward "plug 
and play", using the Proposition (5.2). Alternatively using the local volatility one can 
use Dupire's forward equation to obtain the call prices for all strikes and maturities. 
The determination of the implied volatility is more involved, since the relationship 
between local volatility and implied volatility derived in [28] and recalled in (5.3) is 
correspondingly more complicated. 

Another important issue concerns rigor. Two questions are often asked concerning 
asymptotic expansions used in mathematical finance and also in other areas of the 
applied sciences. The first is, are the results rigorous? The second, related question 
concerns the role of the boundary conditions and the influence of the boundary 
conditions on the asymptotics. In the present paper, in the style of classical asymptotic 
analysis, our main concern is not on full rigor, but rather to provide details of full 
expansions. As opposed to previous works we provide full detail of intermediate 
steps. 

However, in order to point out the rigorous underpinnings of this work, the influ- 
ence of degenerating coefficients, and of boundary conditions, we devote a section to 
the discussion of how (certain aspects) of how our results here can be made rigorous, 
based on work of Azencott and co-workers [6], the relevance of whose work seems 
to have not been noticed in mathematical finance heretofore. In particular, we point 
out, that, the Sabr and A Sabr model are associated to an incomplete Riemannian 
manifold, when f < 1. 


2 The Family of Models Considered and Application 
of the Heat Kernel Expansion 


2.1 An Efficient Approach 


In this section, we introduce the family of models to which our results will apply. 
It is understood that we are working under a risk neutral forward measure and are 
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assuming zero dividends for the asset process 


df, =C(ft, Dard Wi , (2.1) 
da, = a (a, t)dt + v(a, t)dWo, , (2.2) 
< аў, dW > = p(t)dt (2.3) 


with initial conditions at time zero given by (sọ, ao). Here W; апа W, are Brownian 
motions with deterministic, possibly time dependent correlation p(t). 

The associated Kolmogorov backward and forward equations for this family of 
models are given by: 


1 1 

pi + =С?а?руу + pCavp a + 7 V paa + ара = 0 (2.4) 
1 1 

pi— g C pr — (оСаър) ға — 50 P)aa +(ap)a=0 (2.5) 


The matrix a with entries a;j, i, j = 1,2 


C?a? pCav 
а= po aa (2.6) 


is the diffusion matrix. 
The price, at time ¢ of a European call option C(f;, at; K, T) struck at К, under 
the forward measure with maturity T' is given by the expectation 


Cfi. а; K, T) = Ec ar — K)']. 


which can be expressed via the transition density p( f;, ar, t; fr, a, T) in the form 


CU ars t; К, De (XY avt fca arde. 


Here and throughout this paper we will assume the expectations are taken with respect 
to the a (given) risk neutral measure. 


Heat Kernel: Time homogeneous case 


In the time homogeneous case, we may without loss of generality assume the initial 
time is O. It is well-studied in differential geometry and stochastic analysis, under 
certain technical conditions (see the discussion in Sect. 3), the transition density has 
in the time homogeneous case the following expansion as T — 0* 
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_ d? (fo,ag; Ла) 
2T 


D(fo. ао; f. a, T) = Sel, aU (fo, ao; f. a. D= (2.7) 


where, letting xy = (fo, ao) and x = (f, a) 


e 2 is the volume form associated with the Riemannian metric determined by the 
inverse of the diffusion matrix (2.6). The inner product, denoted < .,. >, of two 
vectors A and B is given by 


<А, B > = gijaibj 


where the Einstein summation is used to sum over repeated indices. 

d (xo, х) is the geodesic (Riemannian) distance between xo and x, in the above 
mentioned metric g;; corresponding to the inverse of the diffusion matrix. 

U is the series expansion: 


n 
U (xo; x, T) = У ux(xo; x) T* + O(T"*"). 
k=0 


The uj's are called the heat kernel coefficients. In particular, ио(хо; х) = 


у A(xo, X eh m. where А and V are defined below. 
А is the Van Vleck-DeWitt determinant: 


A (Xo, х) = 


1 4 ( 1 n 
e 3 
g(xo)g(x) 2 дходх 


e P = el ^P is the exponential of the work done by the vector field A along the 
geodesic Y, with V — vd; and 


yi =b — m | vae? | | Q.8) 


Here the bracket “ < -,- >” with vector entries V and W is defined via the metric 
81} аѕ > SijVi wj. 

By adding and subtracting first order terms (2.4) can then be re-expressed in the 
form 


1 
u + yAgu +V: Vu —0, (2.9) 


where Ag is the second order Laplace Beltrami operator z D (VE. gii aa 


Note that the zeroth order heat kernel coefficient is known in closed form provided 
we have available in closed form both the distance function and the geodesics. The 
higher order on diagonal heat kernel coefficients и; (x, x) have been calculated up to 
order 4 in very general settings. On the other hand the efficient calculation of the off 
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diagonal heat kernel coefficients U;, for i > 1, is still an active field of research. We 
refer to Hsu [34] for an in-depth introduction to heat kernel expansion in stochastic 
analysis perspective. 

Given the heat kernel expansion in (2.7) for the transition density p, the call price 
C as T — 0* has the expansion 


n 
1 d2 (x 0) 
C(s0, ao; К, T) ~ Se 7^ Gi, a)T" dsda. 
k=0 {s>K} 


where for notational simplicity we have denoted by Gg the expression 
G&(s, a) = (s — K)ug(s, a)y g(s, a). (2.10) 


Heat kernel expansion: Time inhomogeneous case 
For models of the form (2.1)-(2.3) with time dependent coefficients, the heat kernel 
is modified as follows: again, let хо = (fo, ao) and x = (f, a) and lett = T — t 


_ d? (хо;х) 


p&o, t; x, T) = Luo. t; x, Т), T) (2.11) 
ЛТ 


where now the distance function d, the Van Vleck-De Witt determinant ^ and the 
contravariant drift (2.8), depend explicitly on time because of the explicit dependence 
of the metric and the drift on time. The series expansion H in this case reads 


n 
uxo, t; x, T) = У ио, x, т + OG"). 
к=0 


Moreover, the explicit form of the heat kernel coefficients is given by the following 
formula 


я d(fo,ao, f,a,t) F 7 
"9 е | а.а) BAC FCO), (p), f. а) D 
0 


ot 


and recursively 


ио(хо, x,t) [doo fa.) pk-l 90,1 
ир(хо, X, f) = ———À——— LULA m 
к(хо, х, t) а“ (xo, х, t) Jo Us k-1 3 0 
(2.12) 
where 5(0) = f,a(0) = a. Since the definition of и; always involves uo, it is 


convenient to factor the latter out and to define 
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^ Uj 
úi = — (2.13) 


uo 


Comparison with Henry-Labordére approach 


In [39] Henry-Labordére takes a different, in the time-homogeneous case equiva- 
lent to ours, approach to implementing the heat kernel expansion which involves a 
complex line bundle. Thus Eq. (2.9) is recast in the form 


и +87120; + Aig! g" + (8; + Aju + Qu = 0, (2.14) 
where 
Ai = gij A 
where A/ is given in (2.8), and where 
Q = gil (Aj Aj — bj. A; дА) 


While the form (2.14) is equivalent to our form (2.9) it makes it necessary to 
introduce the potential term Q. The motivations for introducing such a formulation 
are undoubtedly related to the arduous task encountered in quantum field theory 
of determining the higher order heat kernel coefficients. This covariant approach, 
introduced among others by Avramidi in [4] and discussed in his lectures on mathe- 
matical finance [5], introduces heavy machinery, complex line bundles gauge groups 
etc., which for instance in the case of time homogeneous parabolic operators leads 
to the determination of heat kernel coefficients и, via the solution of the transport 
equations: 


1 
(1+ суи = PAT? (D+ О)А!?Ри,_1 
where 


oj = Vio У; = 0; + Aj 


c= glo; 


This covariant approach undoubtedly has computational and conceptual advantages 
when calculating the higher order dimensional heat kernel coefficients. However 
the method explained in the following subsection, due to Yoshida, is we think, much 
simpler and more effective in linking in an intuitive way the coefficients of the original 
parabolic operator to the и;. Moreover it is extremely easy in Yoshida's approach to 
incorporate the influence of time inhomogeneity on the heat kernel coefficients. 
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3 The Starting Point for Rigorous Justification of Heat 
Kernel Method: Varadhan's Lemma 


This section is meant to give the reader an intuitive grasp of some of the key concepts. 
We do not aspire to completeness but try to indicate original sources where various 
issues are dealt with in depth. This section can be skipped by those interested only 
in the practical results. 

Varadhan's lemma, in the form he first derived it (see [52, 53]), applies to the 
case where the underlying operator is uniformly parabolic, with sufficient regularity 
on the coefficients. In its original form as obtained in [52], it relates to the unique 
fundamental solution in all of IR" associated to a diffusion 


p;— Ір = 0 
and finds the small time behavior of such a diffusion to be 
lim —2t log p;(x, y) = d? (x, y) (3.1) 
t—0 


where d is the Riemannian distance introduced in Sect. 2, associated to the principle 
part of the diffusion operator. Varadhan proved (see Theorem 4.1.3) holds uniformly 
for x and y in compact sets, for which d (x, y) remains bounded. When expressed in 
the above form, one has in mind that the domain considered does not have a boundary. 
Varadhan's lemma can be viewed as a weak form of the zero-th order heat kernel 
expansion described in Sect. 2. In fact, a general principle is that in many cases, once 
it is known that Varadhan's lemma holds, the full heat kernel expansion (suitably 
modified by a Levy parametrix, see Sect. 3.4 in [28] readily follows. 

On a domain with a boundary, as is the case encountered in this paper, where the 
domain is R2, or the upper half plane! the situation is more delicate. It is natural that 
this be the case, since to begin with, one on a domain with boundary, one needs to 
deal with the possibility that the diffusion can reach the boundary. In cases where the 
diffusion can reach the boundary, in order to uniquely define the diffusion thereafter 
we can impose an absorbing boundary condition. This amounts to considering the 
transition density for the diffusion that is killed the first time it reaches the boundary 
of the domain. 


рр(х, у, t) 
= Prob. of reaching y at time t 


and not reaching the boundary before time f, starting at x 


! Depending on whether forward or log of forward is taken as state variable. 


Second Order Expansion for Implied Volatility ... 99 


This leads to a general construction of the so-called minimal heat kernel which 
applies to non-compact Riemannian manifolds like R2. In such a construction, which 
is discussed in detail in Hsu [34], the manifold is exhausted by a sequence D, of 
nested compact domains for which the Dirichlet heat kernel p, (diffusion killed on 
exiting Dn), is constructed and the minimal heat kernel corresponds to the pointwise 
limit of these as n — оо. It is intuitively clear that at points on the boundary of the 
manifold where the diffusion can reach the boundary, this construction leads to a 
fundamental solution satisfies a Dirichlet condition in the backward variables. 

As was shown by Azencott [6], a simple sufficient condition for Varadhan's lemma 
to hold that does extend to the case where the coefficients degenerate at the boundary, 
is the case of complete Riemannian manifolds. To put this geometric concept into 
perspective, consider the case where the underlying manifold consists of the first 
quadrant, corresponding to non-negative forward price and volatility. In this case, 
the manifold together with the natural Riemannian metric associated to the inverse of 
the diffusion metric, is complete, if the boundary (and the other points “at infinity", 
if the domain is unbounded) are at an infinite distance from any point in the interior 
of the manifold. It can be shown that a Riemannian manifold is complete if it is 
metrically complete, i.e., all Cauchy sequences converge to a point in M (and not on 
the boundary of M). 

The above, with a modern proof appears as Theorem 5.2.1 in Hsu's book [34]. 


Theorem 3.1 Let M be a complete Riemannian manifold and py (t, x, y) the min- 
imal heat kernel on M. Then, uniformly on compact subsets of M x M, we have 
(3.1). 


When the underlying Riemannian manifold fails to be complete, Hsu [36] refined 
the work by Azencott by showing that a sufficient condition under which Varadhan's 
lemma is still guaranteed to hold is 


d(x,y) < а(х, oo) + абу, оо), (3.2) 


which in our setting amounts to requiring that the distance of x to y does not exceed 
the sum of the distance of x to the boundary and the distance of y to the boundary. 
Below we illustrate this with a couple of examples: 
Consider the well known distance function associated to the family of A-Sabr 
models (7.8) in the ( f, a) plane. 


e When В = 1, the boundary а = 0 is at an infinite distance from any point in 
the interior ( f, a). This is because the quantity q involved in the definition of the 
distance equals 


f | 
| шп | —du| = оо 
e>0 и 


€ 
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Similarly it is easily seen that the f = 0 axis is at an infinite distance from any 
point in the interior. Thus in the case В = 0 the Riemannian manifold is complete 
and Varadhan's lemma holds for all points in the manifold. 
e When В < 1, since 78 is integrable at zero, the points on ће f£ = 0 axis lie at 
a finite distance and those on the a = 0 axis at an infinite distance. Thus here 
we need to apply the sufficient condition (3.2) to guarantee the applicability of 
Varadhan's lemma. 
For the family of SV models 


d f, = a;d Wi; + pi pdt 
da; = Yad dWo, + Hadt 
< dWi,dWo, >= pdt 


where о and Y are constants. Then 

The (Gaussian) curvature of the Riemannian metric naturally associated to the 
problem is independent of the correlation and of the drift and is given by. 

The curvature is equal to 


(q — 2? 47D (3.3) 


Also, it is easily seen that when q < 1 the a = 0 axis lies at a finite distance from 
interior points (it suffices to note that vertical lines are geodesics) and also in this 
case the curvature of the metric blows up at a = 0. 


A final note concerns the relationship between the remarks above and the inter- 
esting recent work by Doust [19] who in determining call prices in the Sabr model 
for 0 « B < 1, takes into account the probability that the forward hits zero. Doust 
correctly points out, just as Lewis had done, in the case of the CEV model (see [42] 
p. 305), that the call price needs to be adjusted to allow for this possibility. There is non 
conflict between this result and the modified version's of Varadhan's lemma. These 
simply give sufficient conditions for the distance of points x, y from the boundary 
(here "infinity"), so that the effect of paths of the forward which reach the boundary 
is exponentially negligible in the small time limit. As Doust points out and as his 
numerics shows, for longer times these need to be taken into account. 

We also would like to remark that the when the local volatility component С( f, t) 
of the local-stochastic volatility models is such that С(0, 1) = 0 (for instance 
CCf. t) = Y( f P with 0, В < 1, the series constructed by means of the heat kernel 
vanishes on the boundary in the backward variables. Remarkably, this comes “for 
free” (see (8.1) in Sect. 8) and was not part of the heat kernel’s explicit construction. 
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4 The Projection Method via Gyóngy Projection 


A driftless diffusion (say in the forward measure) 
df; = oL( ft, t)dW,, 


gives rise to the so-called local volatility models, made famous by the work of Bruno 
Dupire. Given a two factor local-stochastic volatility model, of the form 


df; = b(t) fidt + C(ft, Nard Wit 

аа; = ша, t)dt F v(ar, t)dW2,1, 

fo= f, а=а 

< dWi t, dW; >= p(t)dt (4.1) 


the Gyöngy projection technique yields a one factor model 
df, = b(t) f (dt + oz Cfi, DW, (4.2) 


with effective local volatility o; (f+) and drift b defined by 


ej. Т) =0°(F, TE |a} fr =F, fo= fao=al — @3) 


Note that the effective drift of f, hasn't changed when the original drift is of the 
special form b(t) f (t). In fact, applying Gyóngy's result in our case, the effective 
drift is given by 


b(F, T) = E[b(t) f | fr = Е, fo = f, ao = a] (4.4) 


and the latter clearly equals b(T) Е. 

The local volatility model (4.2) has the same marginals with respect to the f vari- 
able as the original two factor model (4.1). Independently of Gyóngy, this projection 
(sometimes also called? marginalization) technique was discovered by Bruno Dupire 
who was the first to apply it in mathematical finance in [21]. 

As pointed out and exploited in Hagan Kumar Lesniewski and Woodward, in 
Hagan Lesniewski and Woodward [32] and in Henry-Labordére [39], formula (4.4) 
can be expressed using the joint probability density p( f, а, г, Е, A, T) as 


C?(F,T) fr A? p( f. a, Е, A, T)dA 


2 
oO (F, T) = 
í Jr, P(f.a, F, A, T)dA 


(4.5) 


?This expression was coined by Marco Avellaneda, who in 1999, without prior knowledge of 
Gyóngy or Dupire's work, independently discovered the technique. 
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Therefore plugging in the expansion (2.11) into (4.5) we obtain 


n 2 . 
CF, T) fa, X Ae aj (А, Тут'4А 
ОЕ, Т) = i= i (4.6) 
Jn, È 0А, TO 4A 
i=0 


where 
оо =: /g(F, A, T)uo (4.7) 
=: Ĉ (4.8) 
o; = /g(F,A,T)uot;(f,a, F, A, 1), ixl (4.9) 


1 А E ad 
Letting є = t and $ = + 


Qn(t))2 


and where we have canceled the common factor 


the above may be expressed in the form 


n 
CUS E. Aloje* dA 
i=0 


(4.10) 


of (F,T | f,a) = 


Now we apply the Laplace expansion to obtain an expansion of the local volatility. 
It is convenient to consider separately the time homogeneous and the more involved 
time inhomogeneous cases: 


4.1 Laplace Expansion: Time-Homogeneous Case 


Proposition 4.1 Suppose $ (Y) has a unique minimum in (0, +оо), at the point Ao. 
The asymptotic expansion of 


+00 Е 
А f(A)e xx dA, (4.11) 
0 


as є — О is given by: 


2лє _ 4o 
— o" (A9. 


ф” (Ao 
= [Oo + f (Ao)e + f? ae 
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where 


f (Ao) = f(Ao) 
f'A) _ @? Ao) (Ао) FADO (Ao) | 5(¢9) (Ao)? f(Ao) 


n 40) = зто) 809" (a9)? 2(9" (AQ)?  ' — 24(9 (A9)? 

(4.12) 
f2 (Ao) = (4.13) 
— — (1480 (9" (49)? f Cape (Ao) + 840 (9" (49))* f" (Age (ao? 
1152 (ф”) 

(4.14) 
— 8409" (Ao) f' (Ap) (Ao)? + 385 f (Ao) Ó (Ao)^ + 144 (ф” (Ag))* f (Ao) 
— 360 (" (Ao) f" (Ag (Ap) (Ø (Ao)? f/(A0) (49) (Ao) 
— 630 f (Ав)д”ф® (Ap)? (Ag) + 105 f (Ao) (ф” (Ao)? 6 (Ag? 
— 144 (9" (Ao)? f (A6 (Ag) + 168 f (Ao) (9" (Ao)? Ф) (Ag) (Ao) 
—24 f (Ao) (9" (Ag) 4° (Ag)) (4.15) 


where Ao is the point that minimizes 6° 


In using the Laplace expansion in conjunction with the heat kernel form (4.16), 
we must deal with a more general (than (4.16) expression of the form 


(A 


+оо 
[ (ъало) ља) ал (4.16) 


Let us introduce the notation sË ), i, j = 1,2toindicate the jth term in the Laplace 
expansion above applied to the function f;. We apply the proposition to numerator and 
denominator of the expression defining the local volatility. This yields an expression 


that has the following form 


OD eC + LOYD + ey? + FL?) + o() 
Qo Fe FE) же Ыы уыш) 


This ratio can be expanded in powers of e, and yields the following asymptotic 
expansion which can be applied to obtain the effective local volatility valid in a wide 


The expansion up to the first order (4.12) appears in several sources including in textbook form, 
as in Bender and Orszag [9], p. 273. On the other hand we have not been able to locate a source for 
the second order expansion given here as (4.15). 
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range of models. is: 


f 
LO 
(0) , (1) (0) (0) , „(1) (0) 
MOS (Л (£g tir Otio (fg +h Je 
1 
cross Uo +P = PI? +Л +) 
0 


4 C a e cO e f) IO C^ FIO FIP) +0), e 0 


Let us introduce the following notation in relation to the family of stochastic volatility 
models in (4.1), in conjunction with the notation introduced in our discussion of the 
heat kernel expansion, following Yoshida. Recall from (4.8) the definition of C 


C(a) = v/detg(a, Ў) АС, ао, f. a)P (fo, ao. f, a) 


where on the left hand side we suppress the dependence of С on variables other than 
а, since these are not relevant in the Laplace asymptotics. 


Proposition 4.2 Assume the distance function in family of local-stochastic volatility 
models has a unique minimum, as a function of the final value of the volatility a, 


dl с = Amin. Let ф = m Then the effective local volatility in the family of local 
stochastic volatility models (4.1), is given, up to the second order, by 


о? @, T, ao, fo, f) =: VL D = VO (f, OF V (f. т + VO (f, DT, 
(4.17) 


and correspondingly, by taking square roots and expanding, the local volatility is 
given by* 


€ (0) (1) (2,2 


Op =o; +ор €ctoy' € +--+ wheree = т (4.18) 
where 
o = vo (= C(f)o) (4.19) 
yi 
(1) 1, 
(4.20) 
L 520 


^In (4.17) we once again suppressed on the right hand side the dependence of variables other than 
t, T and f. In the next section, when we combine the asymptotics for local volatility with the above 
asymptotics for implied volatility, given local volatility, it will be important that the local volatility 
depends on the initial time f and on the final time T in the particular way indicated. 
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1 0 2 
Е 78 -(VDy + Av ye 


800/07)3/2 
ао 092669032 (042, 
“бз ша s "neo im (421) 
8(oy 3 
where 
Ve De ete (4.22) 
a. € (ё 260) ө”©) - сово) 
нче е - 
С(с)ф" (c) 
(4.23) 
(пит ,2) 
vpn s VE C UD 4.24 
L f ) у Эу, t) ( ) 
with 
ydg) = — 1 
i 24C (c? (9" (с))® 9" (с) 
and 


Ve GO 

= cQ (120 (6700)? (2€' + сео) OH") ("to ("о 

+ 2€(cui()$"(c)) = € (0990) + 6c)" to (29 tof (12c9" t? EW co) 
+35сС'(с)ф®(с)? + 6С” (с) фо) (3e (с) — 5сф®'(с)) 

- 15€ (9t) (2@®(с) + e$? ce) + (ф”)° (—126” сё”) 

(Ф” (©) + сои (Ф o? — сф®'(с)) + ĉo co" co 

(2467 (с)? — 2471 (d (tO? + Scho (c — 36" co) (—8Ф® (с) 
ed 9 (c))) + 2€ t) (7116699 (©)? + cui Od (Y (49 (с) + сф®'(с)) 
36" C) (2090  c$'9()))) + €? aco 

(o^ "tc (249 (с)? — 5с°ф®'(су?зсф” (с) (-86 to) 

оф) + (6")° (48e ogot + (Ф”(с) — cóc) 
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(569002 + 39^ (с) ®(с))) + 9" cof (—35сф® to? + 359" (999 vo) 
(69 © +сё®(с)) — 36"? (56 to + 2009 0))))) 


4.2 Laplace Expansion: Time-Inhomogeneous Case 


Since the heat kernel expansion for the probability transition density involves 
the backward (in financial terms, spot) time in all places except for the factor 
A/8(F, A, T) in Eq.4.9, the Laplace expansion technique is essentially unchanged. 
As was done in [28], in Sect. 2, we need to develop J/g(F, A, T) in a power series 
expansion around in time around Т = t. Ie. 


9 /gCF, A, T) 
J/8(F, A, T) = JRF, A, t) ES о) 
19./2(Е, A, T 
ADAD т)? + о(т?) 
=: do(A, t) + 4\(А,ї)є + d (A, D) -o(e?)), є=т (4.25) 


The presence of powers of e in the expansion above implies that letting 


=; A APui, 


Qa 


Pi = UC, AT) 


where о; was defined in (4.9), the expansion in powers of є will now follow from 
the expansion in powers of e of 


(B + poe + Be? + 0(2)) (со + cie + coe? + о(є?)) 


= BO co + (BP co + В cie + BO со + cB + 8‹®су) + о(є?) 
(0) ad) (2) 2 2 
Y +Ye+ Ye ote") 
І ГА 1 


(0) 
ї 


a 


Le. it suffices to replace in the Laplace expansion of numerator and denominator of 
the expansion of the local volatility function, a; by Y; throughout. Notice that the 
zero-th order y coincides with a in the time homogeneous case. Since only ao 
enters into the definition of a. this means that the form of the latter is basically 
unchanged. 

The new form of the ve is given in Appendix. 
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5 Coupling with the Local Volatility and Call Price 
Expansion 


5.1 The Key Quantities in One-Dimensional Case 


In [28] we obtained an optimal result for the asymptotics of the implied volatility in 
a local volatility model of the form? 


df, = oL( ft, t)aW, (5.1) 


In order to formulate our asymptotic result in the stochastic volatility setting, we will 
need to combine those results with the results in the previous section. We begin by 
recalling some of the required auxiliary quantities derived in [28]. 


e One dimensional (signed) distance function 


f 
ак) | NC T du, t€[0, T] 
к OL(u,t) 


e One dimensional heat kernel coefficients иу, о and и.о, given by 
И: 5 (di, (К, 0,1 
ul Dp, Клу= | eL Cf. т) exp | (di) CK, m a. (52) 
oL(K, t) K or (n, t) 


1а 
ad _ ub F, K, Е 


апа 


n 
(н? + Hy) +bH+c «f HG. К, zl 
K 


ак, ft) Jk | 2 
dn 

5.3 

* oL. D du 


where 


(от) (Р, D (di) (K, у, t) 
201(}, t) oL(f.t) 7 


д 
HC, К,0 = a Un wot, K,t))]- 


Expansion for call prices 


As noted by Henry-Labordére, following on the work by Dupire and Derman and 
Kani, it is possible to use a formula, which actually goes back even further, i.e., to 
the work of Carr and Jarrow [14] for the call prices C(s, К, t, T) which reads 


5Reference [28] explains how to adjust the results to allow for a non zero but constant interest (or 
other constant yield) rate. 
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d 1 A 2 
C(s, K,t,T)= (s — К) +5 oL(K, и) p(s,t, К,и)аи. 
t 
C(s, K,t,T) —(s — К)? 


k T 
1 2 1 
о1(К, и)е Fh sD Ping, — ШЕЛ uj (s, K,t). 
2A/ 2л 2. \/ | 


сә 


Letting 
T 
U;(w, т) = f wide иду, (5.4) 
0 


the expansion may, in the time inhomogeneous case, be expressed in the compact 
form 


Proposition 5.1 The expansion of the call prices in a driftless local volatility model, 


is given by: 
C(s, K,t, T) – (s — K)* (5.5) 
: > [оа + өрик, (d, 1) + a )u CK , OU, | (s, K,t) 
REL iMd, y i Р puto) 5 і uj (Ss, ). 
Jin 2. 1): +1 2 Lit 42 
Using that 
pue ape 2 
Uolo, Е P EL т. 
2т??? 2 
Ui(o, T) AS ЕЗ e d /2* 


an alternate form for the call price expansion up to order т?/? is 


Proposition 5.2 The expansion of the call prices in a driftless local volatility model, 
is given, in the small time limit т — 0, up to the order 1?/? by 


C(s, Kt, T) (s — K)* 


] qq ( 1 

= [2x 3/2 

= —— e —oOL(K,t)uo(s, K, n) T 
Jon | d? 


3 1 1 
+ (Сик, DuoG, K, 1) + (0L) 77406, K, t) + 55010, DuiG, К, je? 
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Implied volatility 
For the implied volatility the following expansion was obtained in the same paper: 
Proposition 5.3 The implied volatility ogs admits the following asymptotic expan- 


sion, away from the money: 


ops ^ овѕ,0 + овѕ1т + овѕ2т2 +0(т2), Tot, for f#K 


where 
Sf 
0в8,0 = 4 z = (4) (5.6) 
(K, fot) fe XD 
(d) 2 (d) 
3 uy CF, K QoL (KDE о (Куйи  CFK,Da(K, f.t) 
OBS,1 = Е2 m а (К, f, 1)3 И ` 
2 2 3 
" 30в5,10в5 0 30в5 1 Ово 
ne ¿2 20в5,0 &£ 
2 2 ld 
3ogso | OBso , (OL) (K, f) 3 и! EK.) 
£2 8 on(K,t) ФК, f. „ОӘК, t) 
1 
Зов$,1 3085 | ЕЗ E | (oL) (K, t) a RO 
d? ^ 2opso 82 d| оК) Орк, | 
(5.8) 


5.2 Plugging the Local Volatility into the Call Price 
and into the Implied volatility Expansion 


Given the expansion (8.4) for the local volatility in powers of є = т, we can use 
the 1D expansion above to determine the contributions of the higher order terms to 
the implied volatility. Note that the above expansion of the Black-Scholes implied 
volatility in powers of т has an underlying functional dependence on the underly- 
ing local volatility, or, ~ ur F eol + coe. obtained in Sect. 3. In order to 
emphasize this dependence below we write og 5,;[o,]. In order to combine the local 
volatility asymptotics (4.2) with the implied volatility asymptotics, we must plug 
the former into the latter and again expand in powers of e = т. Now an additional 
subtlety arises and taking it into account properly is crucial to the correct derivation 
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of all subsequent formulas. The subtlety is that the local volatility expansion obtained 
in the previous section introduces an additional dependence on the backward variable 
fo which (alongside ao) serves as an initial condition when using the call decom- 
position formula (4.10). Thus f enters in as a parameter and the "active" variable 
in oz ( fo, ao, f. 1) is f. Every time a differentiation or integration with respect to a 
spatial argument needs to be carried out, we need to use f as the active variable. 
This may seem a little surprising because in deriving the 2-D heat kernel we are 
freezing the forward variable f (or K), whereas now we are freezing the backward 
variable fo. This state of affairs is easily clarified when we remember that underlying 
Dupire's derivation of the local volatility function is a differentiation with respect to 
the forward variable. So, in a sense we are mixing a backward and a forward repre- 
sentation. Although this may seem odd, it does carry has some distinct advantages 
especially in the time-inhomogeneous case, since all quantities of interest there are 
evaluated at the spot time t, which is mostly frozen throughout. 

As an example, letting df (f, fo, t) denote the 1 — D distance function introduced 
in the beginning of the preceding section, corresponding to the expansion for the 
local volatility denote от obtained in (8.4) 


1 
z du 
от (fo, ao, Í, t, T) 


fo 
d (fo, Ў, Т) -f 


So, we obtain using that by definition = |T=t = зг 


c1 (fo, ao, f, 0 2 
(o (fo, ao, f. o 


d fo 
= FAL, fost, leno = ] A (5.9) 
€ f 


Therefore the expression appearing in uo can be written 


JP n of 1) (oap i.t) 
f 2. арт) “K ОА ная 


dud 


Applying this procedure throughout, we obtain the following results: 


Proposition 5.4 The small time to maturity expansion of the call prices in the family 
(2.1)-(2.3) is given by 


Ci K,t, Т)— (s — X)" 


1 Фр 1 3/2 3 (0 
— MIELE K, p) aa uer (K, t)uo(s, K, t) 


1 1 
+017 gto (в, K, D + -уор (K, DuiaG, К, n) dd 
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where gr is given by (4.19), a is given by (4.20) and where ио and и are 


given by (5.14) and иі 1 is given by (5.16). 


Proposition 5.5 Given the family of local-stochastic volatility models (2.1)-(2.3), 
and assuming the coefficients do not depend on time, we have the following asymptotic 
expansion for є — 0 


оўу loLlCfo. ao. K, t) = оўу [01] 
+ оф [oz] (fo. ao, K, t + ogg olorlC(fo, К,Т)т?,+о(т) t=T—-t>0 


where 
& 
ов olo1 I(fo. ao. К.) = ри (5.10) 
Ko fo,ao,u,t) 
оңа iloL](fo, ao. K, t) (5.11) 
& In o (fo,a0,K,t)uo,1 (fo, K t}d (fo,ao,K,t) 
Е EJ foK (5.12) 
i d (fo, К, t)? | | 
In the equation above 
dO (fo, K, t) = — (5.13) 
K o ® (fo. ag,u,t) 
uoi Cf, К, t) 
o (fo, ао, fo. t) f 1 1 oP (ford. Wt) o 
(0) АР (0) (0) нап. 
Or, (fo, ао, К, t) K Or, (о, n, t) K (от (fo, ao, и, t))) 
(5.14) 
Also 


SV 
Os 2 (Јо, ao, K, t) 


ш | 368.1 £3 3: E (oP и11 
(49)! ^ 2057, — 840) ^ AOS 


Br uo,1 
(5.15) 


where, recalling (5.3), and, for brevity suppressing the dependence on the initial 
variables ( fo, ao) we have 
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и11 = 


uoi [^ с (тї) 
JO 10 


2 1 dn 
x 0% + too) «f (o. K, na| aa (5.16) 


where 
of Dur) 
1 LU TW R 
е. Llc ums 
= 0 0 ? 
2(19)2 gm 
(Н у) о) 
@ 
1 o 0 0 
HESS 4 E (fO (010) кк 
© (буз (с)? UDO ag ^ 
(Fr) 2) 


1 f a6 Dy2 + oO 2) 
= wa 2 rà fep gm 0) аи er 
2(oy y K (oj » 
f er É (1) a), 0) (0), (1) 
2 / du jo,’ — o; (ор )k +o, (о! YK 
" mo L L VL L 


Remark 5.6 А simplified formula at the second “ш 
A remarkable E of the above expression for оўу. 2 is that in almost the entire 


expression only en ? and pr and their derivatives are involved. Recall that the 


determination of the first of tese required only the heat kernel coefficient uo in the 
two heat kernel expansion, while the second of these required only the heat kernel 
coefficient иу. The lengthy expression for a, ) ie. the coefficient of т? in (8.4), 
expressed in terms of the lengthy expression for V2, is required above, only for the 
first term under the integral of (H;)(2). Thus, we may propose as a very reasonable 
approximation for the implied volatility expansion at the second order to use (5.15), 
with all terms the same but using instead the modified 1/4’ defined by 


f 2(o Dy. 
A (°(/ in) oi 
200] )? к (0) 
/ o OD 
+ «(f E ic Jet" —о}?(о{?)к + о} (of dK ) 
K (от P 
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6 Example: Dynamic A-SABR Model 


6.1 Step 1: Reduction to Laplace Beltrami + Drift Form 


In this section we apply our expansions to the dynamic à Sabr model 


df, = C(f, Dard Wi, 
da, = k(t)( — aj) + ajv(t)dWo , 


Our diffusion matrix is 


( ii) - C?a? Y vpCa? 
ET Съра? va? 


with inverse 


1 p 
С2а2(1— 02 2›(1—о2\)\С 
e-( EAS vu) ) 
a?vC(1-p?) +2(1—р2)а? 


Recall that the Laplace Beltrami A в operator is given by 


Ав = 87129; (812609) = gaaj + g^! (g 2g))0; 


In the case of our metric we have 


g7! = —а*%С?ь?(—1 + p; 
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(6.1) 


We now use the definition of the Laplace-Beltrami operator to rewrite the original 


PDE in the form 


1 
и + ^ви T Viuf + Vous = 0 


where a straightforward calculation using the explicit form of the metric coefficients 


yields that the “metric” part of the drift is given by 


vig eh 9 o l1 č 2 
m AG vy f p (uc X ae eje C memo ) 


1 8-1 12 
= С дб). 2 y pa? fP! 
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1 1 1 
у2—- a?vy В 1 — p2a 12 а (—À—— B) 
( i j avy fÊ E d ир" 
1 2 B 1 2 
——4 ару 1 028 ( Y vpfPa ) 
( d a2vy f£ /1— o? 


1 2 2) 
+3 (= Р 
а a2v y f£ /1— p2 ) 


=0 


Thus the metric part of the contravariant drift can be written 


12 
Vn = 5 v Ba? f?P- 19, 
The corresponding covector field, is obtained by lowering the indices 


Vi" = апу V7 = 80У 


Непсе 
12 1 

ym —- 2 ¢2B-1 

1 ; Y Ba f y? fP (1— p?) 32 

1 B 

2 fA- р) 

p 12 3 2 35-1 
vrs p 
* — vY(1- p?) pe "T 
ur BY f^ 
2v(1 — p?) 


Thus the covector form of the metric part of the drift is 


B BYp f^ da (6.2) 


271-09 30-95! v 


Remark: Allowing general C( f, t). 


Via the same procedure, when instead of y(t) / we consider a general form 
C(f, t) in (2.1), we obtain, for the metric part of the drift, when C does not depend 


explicitly on time: 


| 8с pac 
af af 

fd 6.3 

302352834 33 63 


and, in the new variables defined by (6.6) and (6.7) below, this term may be expressed 


in the form 
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ac 
of dx, 


2/1 — p? 


a fact already pointed out by (Henry-Labordére [39] and Paulot in [48]). 

Note also that the first part of (6.3) is a perfect differential 5 d 4 25 d log C(f) and 
can thus be integrated directly along the original geodesic (from final point (K, a) 
to initial point ( fo, ao) to get 


1 C(fo, 1) 
log 
2(1 — p?) C(K, г) 
The work done by this term is thus immediately evaluated to be 
C(fo, t) n5 
CED. t) 


In particular in the A-Sabr model, we get 


B 
Fo 2(1— p? 
(sy T 


To this metric part, we need to add the drift coming from the mean reverting 
volatility. This vector has only the 0, component 


ymeanrv _ KÀ — a)ða 

Vmeanrev = 812к(А — а)аў + gx (A — a)da 
pk|t](-a + Alt) к(-а + М) 
2v (—1 + p?) CIS, t] а212 (—1 + p?) 


(6.5) 
Define 
fe 
q = 
y@)C — B) 
We make the change of variables 
vq — pa 
X = o———— (6.6) 
vy 1 -— p? 
a 
y=“ (6.7) 
v 


df = Y —p2Cdx +Y pCdy (6.8) 
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This change of variables is defined in such a way that the principal part of the 
partial differential equation, expressed in the new coordinates, is the standard Sabr 
model corresponding to the (rescaled) hyperbolic plane, i.e., of the form 


1 
2 vA? (Uxx + Uyy) 


After writing the backward partial differential equation in the form I^ B + V, each 
of the terms is covariant. This means that in order to determine the contravariant 
drift in the new coordinates, it suffices to transform the contravariant drift in the old 
coordinates by the recipe for the change of a contravariant vector under changes of 
the independent variables. However, since the changes of variables we are making 
(6.6) and (6.7) depend explicitly on time it is clear by the expression of the new 
dependent variable in terms of the old, i.e. u(f,a,t) = v(x(f,a, t), y(f,a, 1), t) 
that the in the drift term for the new PDE there is an extra term coming from the time 
derivatives of the independent variables. A simple calculation shows that the metric 
part of the drift (6.2) can be expressed in the compact form 


8—1 
EIUS Р (6.9) 


2/1— p? 


Recalling the transformation (6.6) and (6.7), the time derivatives are 


( 1 y f p' ( 1 y p 
SEE y vy 
' Y ü-Bi-gog ü-e»?" v A-o 
y y р! v p 
= х + + 6.10 
Yi p T po С) Tao! (6.10) 
1 
y = (70у (6.11) 
Note that from the transformations (6.6) and (6.7) we obtain the relation 
d d 
dx = f пс (6.12) 
У р? y fe 1—2? 


For the non-metric part of the drift arising from the mean reversion we obtain 
from (6.12) and from (6.5): 


pk. 00) a. (ee vy) , (А = vy) 


= M d 6.13 
V1— p2v37 d= "(qe os) E Bon 
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All in all the PDE satisfied by v is therefore 
i IyhyH uu, + Uy) ЙЫ + Pry = 0 (6.14) 
t 2 Uy (Uxx yy Ux Vy = V, . 


where 


> B= UE 1 
бд [Р окос v. y jx }ах 
2W1—p2 ү1- р203у2 У 


Since the first part contains an exact part, it is better to express the first expression 
in the equation above in the form 


BYp 


Е В—1 
20-954 “4 


1 
————4@1ор f? 
2/1— p? 

As mentioned earlier, the first part is trivially integrated (see (6.4) and the second 
part of V,,dy so we have the full drift written as 


V.dx + Vydy + -dlog f? (6.15) 
p 


1 
2/1— 


z = 1 
Ү,ах = ок( vy) +-—у% ах 
JV1l—p2v3y2 y 


Е BYp B-1 1 kÀ = vy) 
V,dy —|———— а ау, 
id ( epg Ut pet шй d 


_ BYp 1 M а каты) d 
20-9)ü-pgyyVl-pixcpü—-B)yy P” vy 


where, in the next to last step, we used the relation 


1 
|» (1—-80Ү1—о?х+ю(1—В)Уу 


Га: 


Recall that the difference between V and V is that in V we removed the exact part 
of the drift vector. For completeness we note that, alternatively, including the exact 
part explicitly, we have the relations: 
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> p A- 1 
"c BYf petes vo). xc x ds 
2/1—p9? ү1- р203у2 y 


Е d еб ы To 
Ол о ВОЛ оох + ру) У о2о wt 


(6.18) 
Vy = (2и + a. (6.19) 
and the associated contravariant components: 
du By убок ру) о, 
2/1 = pha - BVT = p?x + ру) y1- phy? 
VY = vey + А0 (6.20) 


6.2 The Geodesics in the Poincaré Plane 


The changes of variables we have made in the last section, transform the original 
operator into the operator 


9 + +50 re + 02) + V*9, + Уд, 


The factor v? in front of the second order part means we are not yet in the standard 
Poincaré plane where v = 1. To adjust for this we further make the change of time 
ГА 
і = —5 (6.21) 
v 
so, in the /" variable the operator is now in standard form 


m 2 ve Vy 
д, + 57 (9; + 87) + 29k + — 9 

As is easily seen the covariant form of the drift remains the same (due to offsetting 
effects), but the contravariant form changes by a factor of v?. 

The geodesic passing through the points (x1, y1) and (хо, y») is known to be the 
semi-circle with origin at (xo, 0) where 


хў —хү +0 —%/ 
2(x2 — x1) 


хо(х1, yi, X2, Y2) = 
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and where the radius R is given by: 


R= үй + (x1 — хо)? 


= J + (x2 = xo (x1, Y1, х2, y2))? (6.22) 


It will be convenient to use standard polar coordinates 


x = xo + Ксоѕ0, Ox0 xz 
у= Rsinð, 0<0 <x (6.23) 


Notice that if we denote by (x, y) the running point on the geodesic, then the polar 
angle corresponding to this point, issuing from a fixed point (x2, y2) is given by 


0(x, y) 
y y 
arctan( ETE ) arctan( OE )>0 
uu х= 2(x5—x) x= 2(x2—x) (6.24) 
arctan ( ua ) + л, arctan( CNN )«0 
Ne = = a 


6.3 Calculation of the Work Done by the Drift 


The integral we need to calculate is 


ах + V,dy 


V(r, yi), (42,42) 


where Y((x1, y1), (хо, y2) is the geodesic joining the points (x1, y1) and (x2, y2) in 
the standard hyperbolic plane, which is the image of the original f, a planed under 
the coordinate transformation (6.6) and (6.7), call it A. 

Let us first carry this out for the so-called “metric part" of the drift, i.e. the part 
corresponding to x — 0 and no time dependence in the coefficients. 

Then, using (6.18), for the metric part of the drift we have that 


| V"dy 
Ү(х,у) 
05 


BYp cos 0 do 
2 — 9? 4 V1 — p? (xo + Rcos0) + Rp sind 
1 
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Therefore we must calculate the integral 
1 


cos 0d0 
М1 — p? (xo + R cos0) + pRsind 


To evaluate this integral, note that we may write the identity 


cos 0 
М1 — p? (xo + R cos0) + pR sin 
/1— p? 


= 2 + 7d log (vi — p? (хо + Rcos6) + pR sing) 


xo (1 — 02) 1 
R М1 — o? (xo + R cos0) + pR sin 
and therefore the integral can be written 


1— p? 


R 


(d; — 8) + 2 log (vi = p! + R cos 8) + pR sin) [2 


1 


ө 
ке 
R 2 V1 — о2(хо + R соѕ0) + pR sin 
1 


do (6.25) 


and it remains only to evaluate this last integral. 
This may be expressed as 


05 
1 
dé 
à, V1 —p2x0 + R cos (0 — arctan( m э) 
=p 
Letting 
ај = ү 1— p? xo 
ў = arctan( E); 
/1— р? 
the last integral can be expressed as: 
05 
1 А 
/ —d(0 — Y) = (6.26) 
aı + Rcos(0 — Y) 


1 
———————— 
h 
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05—Y 1 
= — do 6.27 
|, а + К cos0 ( ) 


2 tan $) mne if a? > R? 
Мак Y 


0—01— 
2 | а] 0—6-9 р 2 2 
jeg est prm tan 5 £) loo 9 if ap < R 


0—80. " 
Ue tan] е Т if a=R 


a,—R 
arctan ( GER 


(1 cot(2)) ja ae с if of =—R (6.28) 


So that all told, taking into account (6.4) we obtain the following expression for 
the line integral corresponding to the metric part: 


log(P) mE 
Y /1— p? 
наги, = p (05 — 61) 4 Ê log (A p? (xo + R cos 8) 4 pR sin) ES (6.29) 


xo (1 — р? fo 


) \ Æ 
R 1 КоЛ x 


) (6.30) 


ASabr part 


Next we deal with the specific to A-Sabr part, which have found above to be 


А – А – 
юк@.— аа О ИА 
Л — p2v3y2 y3y? 


Inserting this into the line integral that must be evaluated, we find 


(6.31) 


- 05 
Xr 
= 2 csc 0d0 
] — o2y3 
0 


140 


pk / 
1— 922? 
У IS S 
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n 0d0 = log(tan(0/2)) + C 


[= - 1 +c 
51120 sin8 


n = log(| sin 0|) + C 


Therefore, all told we obtain for the line integral corresponding to the mean reverting 
part: 


pKh tan(05/2) ок 
log | | (05 — 01) (6.32) 
RA 1-— p2v3 : tan(01/2) — /1— p2v2 
kà, 1 1 к [віп 
1 6.33 
VAR nd sin» v? ogC sin Ө | (o9) 


Since 0 varies between 0 and л and since in this range both sin Ө and tan 0/2 are 
non-negative, we can remove the absolute value signs above and arrive at 


log(P) ur 
pKa tan (> /2) pK 
= (02 — 01) 
R1 — p2v3 tan(04/2) М1 = p2v2 
P кА 1 1 ) K | ш; 
o 
v?R'sinÓj sin&2 v? g sin 0j 


(6.34) 


Time dependent component of drift 


The last integral we need to evaluate arises from the term x; v, + y; v,, which, as we 
have seen, gives rise, after lowering the indices in the hyperbolic metric to: 


Xt Ut 
—dx + J “dy 
/ y? y? 


Y[x,y] Y[x.y] 
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Using the explicit expressions (6.10) and (6.11) we find that 


Xt 
y 

Y x 1 p! 1 v p 1 
= ——(— +p ) +( 

yg" Ji-gy (СЧ-—9Э3?у v J1—p2y 
Ut m vl 
y vy 


So the integrals to be calculated are: 


log(P;) (6.35) 


|| 
= 
|= 
d 
N 
Me 
| x 
ке) 
ыш 
LIS 
d 
N 
gives 
к=] 
М.У 
$— 
Ф 
a 
SS 
el 
oo 
Ф 
a 
= 1% 


which gives 


log(P;) 


1— o? Y /1— p? 


Y {1 tan 05 sin 05 
log log |— | 
tan 01 sin 01 


(6.36) 


7 Calculation of иј Heat Kernel Coefficient 


The calculations of the heat kernel coefficient и (and all other heat kernel coeffi- 
cients) can be carried out either in the original variables ( f, a) or in the (rescaled) 
standard hyperbolic place with metric ds? — a (dx? + 02), provided we use the 
transformed drift vector (6.20). | 

Recall, from formula (2.12) with i — 1 that 


1 а 
и =w05 | ug! (Luo + ux) ds (7.1) 
0 


Recall that a time change (6.21) gets rid of the extra factor v?. 
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where 


d(fo,ao, f,a,t) $ 7 
ир = УАР ew f аео), а), sa „) (722) 
0 


——————————— 
uot 


where we recall that using (6.4), (6.34) and (6.36), P is known in closed form and 
where in the a formula above the distance function is given by (7.8). Also recall that 
A is the Van- Vleck De Witt determinant, and we know that in the hyperbolic space 
of curvature —1 


d 

dE sinh d’ v) 
As noted by Willmore [54] p. 208, the Van-Vleck De Witt determinant (which Will- 
more calls the “discriminant function") is independent of the coordinate system cho- 
sen. This means that to express the VVDW determinant in the original coordinate 
system, it is sufficient to use the same expression (7.3) in conjunction with the dis- 
tance function in that coordinate system, i.e. (7.8). Since all ingredients in ug are 
explicit we can now simply insert this expression into (7.1) and use a symbolic 
calculation engine like Mathematica or Maple to do the calculations. 

Alternatively, we can do the calculations using the formula for иј in the (x, y) 
plane. This has the advantage that many of the terms arising during the calculation 
can be computed, once again, in closed form. We discuss this further in the remainder 
of this section. 

The fact that A depends exclusively on the distance simplifies some of the expres- 
sions involved in и |. In polar coordinates based at the point y, we have that the Laplace 
Beltrami operator can be expressed in the form 


вво 14) 

a Nar PX A J)ar ар { 
ə? ФА n—]Y 8 ә? 

m дг? +( А ш r or „апа, D agi Оз) 


see Proposition G.V.3, p. 134 of Berger-Gauduchon-Mazet [12] and use the fact that 
(A)T! = O(ru) as defined in С.Ш.3 in [12]. For brevity's sake, let 


А 
or m1 
«кре (= Ы =!) 


Using the explicit form of А on calculates that, in the case of standard, hyperbolic 
plane, 
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c(r, t) = Cotanh(d) 


1 
„р атту; 


so that the above operator сап be expressed as 


2 a 92 
58 + c(r, ar talt, 0, Ds 


Letting Ag act on uo, there are some simplifications due to the fact that A (Van 
Vleck De Witt determinant), depends only on r 


Авио = Agly p^ Pir, @)) 


9? д? 
- (2 — tcc, DË = +alr, 6, Napa) аа) 


= (PC, ДЕ t cr, os.) (VAC) + С 6.03 Ju o») (у A(r)) 


Therefore 
Lug = 
= PLA 5 + A ALP 
l1. 
=PLVA+VAPLA+ 58 Ai AjPVA 
And so 
ug Luo 
LVA 
= —— 4+ LA+ <g4 AA; 
JA - Ј 
LVA 
= +LA+ A+A 
PS у zi a) 


So, that putting it all together and inserting into (7.1) we obtain an expression of 
the form 


1 LVA 
varz | ELAS. 2d 2 (AZ + A?) } ds 
d X2,92,X1,y1] МА d 


Since the first part e depends only of the distance, it can be integrated in 


closed form, and we obtain, using Mathematica, and the expression (7.7) below, that 
it equals 
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Е (—1 + d? + d coth d)4/d csch d 


4q? 
Now 
LA 
y - А 
= 3 (Axx + Ayy) + УХА. + У” ДА, (7.6) 
LVA 
JA 
(—1 — 3d? + (1 + d?) cosh 2d) csch*d/sinh d 
= (7.7) 
8a? (csch d)3/2 


So a key quantity that we must calculate is the action of the differential operator L 
on the exponent A in P. 

Expressing this exponent using the standard polar coordinates (6.23). We have, 
with fixed point (x2, y2), letting for brevity 6; = 0 (x1, y1), 02 = 0(xo, y») with 0 
given by (6.24) 


01x.9.X2.92) — _ 
A(x, у, x2, Y2) = RG, y, x2, w | (—V, sin Ө + V, cos0)d0 
0» 
Therefore 
aA TD z 
FP у) = [r (-% sin Ө] + Уу(х1, yi) cosi) ЭЧ m—— 


0€Qd.ya.x2.92) — | _ 
+} OX n (— V. sin Ө + V, cos 0)d0 
0 


: X=X1,Y=Y1 


92.4 Ө (х1,02,х2,02) En " 
эт? ©? yi) = Ra f (—V, sin 0 + V, cos 0)d6 


2 х=х1,у—1/1 


+2 [ao (-*. sin 01 + Vy (x1, y) 60s 61) (ЛЧ |х=х1,у=м 


+ [R (V, sin + Ӯ, л, л) costi) ө ben ven 


and similarly for other partial order derivatives 2A, Note that the evaluation of the 


terms above requires knowledge of the value of the integral which was computed in 
the previous section. 


7Note this angle is not the same angle as that used above in geodesic polar coordinates. 
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Also we may calculate the second order partial derivatives Axx, Ayy» Axy and 


express these in terms of V and derivatives of the polar angle 01, given as in (6.23), 
(6.24) and of the radius, given by (6.22). 


7.1 Distance Function and Derivatives in Time Dependent 
Case 


The distance function in the original coordinates is given by 


2 Е _ 2 
До ауу Leow (1 ud 2e(t)(y1 — yoda + Q1 — yo) ) =й 


2(1 — o2)yoyi 
(7.8) 


oOo ( v?qrq? — 2p()v(r)(ai — ag)q + v (t) (a — ar 
— — cosh 1+ 
v 2(1 — p2)ayaz 


where 


1 
Y) f 


pdf = 5 P- A) 


fo 
ME OE [A = 


Next we check the value that minimizes the geodesic distance from the hyperplane 


f=fi: 


This value is given by 


c(t) :— (ai)min (t) = ao ad = 2v(t)q(t)aop(t) + v?(r)q?(r) (7.9) 


After some simplification the value of the minimum distance at the minimum point 
this may be written in one of the two forms, one corresponding to a distance function 
and the second to a signed distance function (of the point (ао, fo) to the line f = К) 


i. а aop? — a + 4202 + 2agqvp 


min T » ао (1 _ p?) i 


unsigned 


(7.10) 


and 


1 1 
d$. = 1 + 2s 2 + 2712. | В а 
тіп v og (za =) [v aop уа? aovpq т v^q signe 
(7.11) 
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As mentioned above, this second version of the distance function can become 
negative, but it's absolute value coincides with the expression above. This distinction 
seems to have not been emphasized in previous treatments of the subject. In calcu- 
lating the local volatility expansion, especially the term V», we require derivatives 


up to the sixth order of ће ф = 2 These are listed below: 
Let 
p? (7.12) 
_ Csch(vd)d (7.13) 
vJ ag + 4? — 2qvp (ao — aop?) 
S 
_ 3Csch(v d)d 
aov (aj + 4? — 2qvp) (-1- p?) 
(7.14) 
ф® 


(sese ( Adagv (—14 р?) | Ja + q2v? — 2qaovp(1 - dvCothiav))Csen(d»)) ) 


(av? (a8 + qv? — 24aovo) ^ (-1 + 02)?) 


(7.15) 

p” 

30Csch(dv 2dv 4p Co End vCoth(dv))Csch(d v) 
Е Си (у ao (а8+4222—24аоър) (=1+p2) 
aov? (—1 + p?) 
(7.16) 
and 
FXO) 


1 
(av? (a? + q?v2 — 2qavp) ^ (-1 + p) 


2 
x (iscsenaa С 14 p) 3 (а I q^? 2gavp) Coth[dv]Csch[dv]? 


+dv (ч + q^ — 2qavp) (2+ Cosh[2dv])Csch[dv]^ 


+ 18a a? + q?v? 2gavo ( 14 p) Cschldv (dvCosh{dv] — ван) (7.17) 
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8 Probability Distribution, Local and Implied Volatility 
in Dynamic A-Sabr Model 


Collecting the results in this paper, we see that the probability density function in 
the A-Sabr model is given, at the zero-th order, by 


P(f, 4, t, K,A, T) 


Е 1 1 КЕЙ! yg d 
2x(T — 0) /1— о(Т)?С(К,Т)ь(Т)А? `/С(К,) d 
б oka (- pane Bat (yaQ). Һа) dp) +log(P)urtlog(P;) 


where the distance d in the above formula is given by (7.8). In the case of the A-Sabr 
model, the results in this paper allow us to sharpen the above results and obtain the 
expansion for the heat kernel up to the zero-th order in the form 


р (f. a, t, К, A, T) 


A 
1 І ү? m 
© 2n(T — t Л p(TGv(T) (T)KP A2 \ KS sinh d 


2 d ay, f.a, F, 2 
= aft (= fo oto 00 8400040 Ла) dp) osa g-Hog(P;) 


xe (8.1) 
where log(P) мк is explicit and defined in (6.34) and also the fully explicit log(P;) is 
given by (6.36). Also the derivative ofthe distance function appearing in the exponent 
of the exponential can easily calculated for any specific functional form of the time 


dependent parameters Y(t), v(t), p(t) by differentiating (7.8) with respect to f. 

Using (8.4) we see that the zero-th order local volatility oj, for the family of 
stochastic volatility models (2.1), (2.2) (for any local volatility C( f, t)), can be 
expressed, in the form 


0 
of” (fo, ao, К, т) 


fo 1 
= CCK, (amin = C(K, D ad — 24a1v(Dp(0q + 42220), (su. K)- L zen) 


To determine the implied volatility we need to calculate the integral 


[ 1 P 
u 
к C(u, t) a] + 24( fo, waivpg (for u) + 2 fo, u)v? 
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Note that this integration is easy to carry out because — с 5 = dq. In particular, in 


the 4-Sabr model case, with C (f, t) = Y(t) f? we get: 


E 1 
ов8 ЫТ ve] — руи 000 + yag ОЛЛО (8.2) 
d 3 
== gO lamin = d (8.3) 


min 


where & was defined via (5.6), 2», is the signed distance to line f = K given in 
(7.11) and q was defined by (7.9), and we recall that ao is the initial value of a;. 
This zero-th order result, in the time homogeneous case, where the coefficients v, o 
are constants and where Y — 1 agrees with that given in Berestycki et al., Henry- 
Labordére in his Encyclopedia article [35] but does not agree with the formula in 


Hagan et al. [31], a discrepancy already pointed out by Obloj in [46] 


of = е? + oe + oP e? Le wheree = т (8.4) 
(1) (1) (0) ү,(2) 

_ боо, VE Vi )°+4У; VE a 

= (09 + т a (8.5) 
Гу? svn 
where, recalling that c = (a1) min, given by (7.9), we have for a g 
" va? (Ско) +x) e^to - eC) 

o; (f.t) = ‚ (8.6) 


2cC (c)p"(c)? 
where recalling the definition of € from (4.8) we have 
С (с) 


B 1 dmi n 


© a2v(t) v(t) ff / (0 — p20) V sinh(dmin) 


B 1—p2 
x f? elog Pu к)+1ов(Р,) 
B 
K2 


where the quantities in the exponent of the exponential were defined respectively 
in, (6.34), and (6.36), and where the derivatives of the functions ф were supplied in 
(7.13) and (7.14). 

The Black-Scholes implied volatility is now given by (5.12) whose expression we 
recall here 
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Eln a (fo,a0,K ,t)uo,1 (fo, K,t)dmin(fo.ao,K,t) 
(1) EJS К 
Ops = dB 


and where we recall that the definition of uo 1, given by (5.14) (where we need to plug 

in the quantity, and only that term, involves M and hence involves the non-metric 

of the zero-th order heat kernel. Also, recall that dmin was given above (8.3). 
Moreover for the second order expansion of local volatility and implied volatility 


respectively we have 


1 0 0 2 
—A(o1 (ay? + A(o yr yr? 


of ) = 
0 3 


where y. given by (4.24) in the time-homogeneous case and by (A.1) in the 
time inhomogeneous case. The ingredients going into the definition of ye , include 
the derivatives up to order three of C at the minimum point, and derivatives of ф 


up to order six, which were given in (7.13)-(7.17). Lastly we obtain о by using 


expression (5.15). 


0.074 - 


new order 1 
MC "exact" 
Hagan et al 


0.072 


0.07 - 


0.068 - 


0.066 + 


0.064 - 


0.062 П П L П L i П J 
100 100.5 101 101.5 102 102.5 103 103.5 104 


Strike T MC Option Implied Vol MC со Order1 Hag Order 1 new 


100 1 4.004 0.0726 0.0701 0.0735 0.0728 
101 1 3.356 0.0699 0.0671 0.704 0.0701 
101.5 1 3.039 0.0686 0.0657 0.0689 0.0688 
102.5 2.459 0.0666 0.0596 0.0661 0.0666 
103 1 2.212 0.0656 0.0619 0.0649 0.0656 
104 1 1.737 0.0637 0.0599 0.0628 0.0639 


Fig. 1 1 Year; model parameters So 102, ао = 0.65, v = L T 1, В = 0.5, р 0.5. The 
third column contains the option price calculated using Monte Carlo and the fourth column contains 
the (benchmark) numerically inverted implied volatility 
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0.078 T T T T 
new 5101 
0.076 F 5101 Hag /- 
МС 5101 
0.074 
0.072 
0.07 
0.068 
0.066 
0.064 
0.062 
0.06 
0.058 L 1 1 П 1 І 
99 100 101 102 103 104 105 106 
Strike Т МС Оріноп Implied Vol MC со Order1Hag Order1new 
99 0.5 3.939 0.0744 0.0678 0.0751 0.0745 
100 0.5 5.340 0.071 0.07013 0.0718 0.071 
101 0.5 2.499 0.0685 0.0671 0.687 0.0686 
101.5 0.5 2.181 0.0669 0.0657 0.0673 0.0673 
102.5 0.5 1.625 0.0648 0.0596 0.0646 0.0648 
103 0.5 1.39 0.0639 0.059 0.0634 0.0638 
104 0.5 0.973 0.0619 0.0580 0.0614 0.0619 
105 05 0.657 0.0605 0.0577 0.0599 0.0606 
106 0.5 0.427 0.0598 0.579 0.0589 0.0601 


Fig.2 Half a year (Т = 0.5); model parameters So = 102, ао = 0.65, v = 1, T = 1, В = 0.5, 
p = —0.5 


9 Numerics 


In this section we compare the accuracy of our first order expansion with that 
of Hagan-Kumar-Lesniewski-Woodward [31], who do not provide second order 
approximations, on three time horizons. In all of the numerics below, the bench- 
mark prices used were obtained from a Monte Carlo simulation with 40 time steps 
and 300,000 paths per time step. The standard error associated, calculated by means 
of dividing the empirical standard deviation by the square root of the number of 
paths, was 0.0032. The dimensionless parameters involved in the empirical work 
were large, since we took vol of vol v to be equal to 1. 

What we have found is that our first order approximation is more accurate on 
time horizons of 0.5 and 1 year, both in and out of the money. On a 2 year time 
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0.078 T T T T 
new order 1 
0.076 - order 1 Hag] - 
order 1 MC 
0.074 - + 
0.072 - J 
0.07 - 4 
0.068 - - 
0.066 - J 
0.064 - 
0.062 1 1 | 1 1 
100 101 102 103 104 105 106 
Strike Т MCOption Implied Vol МС со Order1 Hag Order 1 new 
100 2 5.3406 0.0749 0.7013 0.0769 0.0756 
101 2 4.962 0.0726 0.067122 0.736 0.0731 
1015 2 4.3558 0.0726 0.0657 0.072 0.072 
102.5 2 3.735 0.0726 0.0596 0.0692 0.701 
103 2 2.94 0.0726 0.059 0.0657 0.0692 
104 2 2.906 0.0066 0.0580 0.0657 0.0679 
105 2 2.552 0.0663 0.0577 0.0641 0.0673 
106 2 2.092 0.0639 0.579 0.0631 0.0677 
Fig. З 2 year; model parameters So = 102, ао = 0.65, v = 1, T = 1, В = 0.5, р 0.5 


horizon, Hagan et al.’s first order approximation is sometimes better than ours out 
of the money and tends to be worse in the money. These results are illustrated in the 
figures (Figs. 1, 2 and 3). 
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Appendix: Form of ve in the Time-Inhomogeneous Case 


yo 
L = 
1 


24€t (с)?адо(с)?ф” (с)5 


CC. D? (12e9" (с)? (—2а(е)©т (с) + с©т(и)(д\(и) + uw)" (c) 


(Ф” с) (267 tod) — dote) Ct" (с) — 2C dio + ити)" е) + doc) Cr coe? ce) 
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Сі(с)ф" (c) (12c9" (c? CAIG) (2dh(c) + c(di(c) + и1(с))ф” (с)) + 

+ TEO (h U) + wi 0) (9^ (с) (44) — cdg (©) + 2e) + u1 (09) + hOp) 
- dy (cy (c) (48ed с) CT (9^ (с) — 24С1(и)а (и)ф” (с)? — ACH ui Q0 (с)? 

— 12c7di (c)Ct" ()9" (c) — 12u (e) Ct" ()9" (с)? + 24cCt di 009" ()9 9 (c) 

+ 24сС1(и)ил (0) 9" (с)ф® (c) + 5c? Ct odi a9 (с)? + 5e? Cru)yui (и)ф® (с)? + 12Ст (с) 
x (aao (4^ to) — сф®(с)) + eo" co (4450 + (i + wi ©) ("о + се (0)))) 
3c! Cra (d, (и) + ити) фф O) — 244g? (co C? Cr co) + 2067 (с)ф® (с)? 

+ Cr cy (©) (670) — 26 (0) — cr c" © (269) + св v) ) 
+ Cro (12e9" (с)? (248) + cdi (с) + и (0) €) (at o9" ©) — 609 c) 
+ doy (с) (—24di (Cd (€? — 24ui()9" (0)? — 48са|(с)ф” (C — 48си (с)ф” (с)? 
— 24có/ (c! dj. (c) + 48d, (c6 (c)6 9 (c) + Aed (c) (с)?ф® (c) 
+ 24си (с)ф" (с)2ф 09) (c) — 48d; (9 9 (c) + 5с?а (с)ф” (с)ф®\ (с)? + 5u (9 (с) (с)? 
— 244; (о) фо) (фо) — 269 (с) — 3ed (c) (вас) + edite) + wi O9 ()) $9 ©) 


+ 2do(e)? (1509 (0)? — 9" t)99 vo) (156 (©) + 16c9'9 (c) 
39" C (2д®(с) + e6(c))))) (A.1) 


where all quantities appearing above where already defined in connection with (4.24) 
in the time-inhomogeneous case and where 


Ct = VAP 


and dj, d2 where defined in (4.25). In the case of the A-Sabr model all required 
derivatives of $ (up to order 6) were supplied in Sect. 7.1. 
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General Asymptotics of Wiener Functionals 
and Application to Implied Volatilities 


Yasufumi Osajima 


Abstract In the present paper, we give an asymptotic expansion of probability 
density for acomponent of general diffusion models. Our approach is based on infinite 
dimensional analysis on the Malliavin calculus and Kusuoka-Stroock's asymptotic 
expansion theory for general Wiener functionals (Kusuoka and Stroock, J. Funct. 
Anal. 99:1—74, 1991 [12]). The initial term of the expansion is given by the geodesic 
distance and we calculate it by solving Hamilton's equation. We apply our approach 
to obtain asymptotic expansion formulae for implied volatilities in general diffusion 
models, e.g. CEV and SABR model. 


Keywords Wiener functional - Stochastic volatility + Hamilton equation * Malliavin 
calculus + Asymptotic approximation + SABR model 


1 Introduction 


There are many applications of asymptotic expansion theory to mathematical finance. 
The most popular is the singular perturbation approach. For example, Hagan and 
Woodward [6] gave an asymptotic expansion formula for implied volatilities of local 
volatility models and Hagan et al. [7] gave a formula for a stochastic volatility 
model (SABR model) based on Hagan et al. [8]. Their formula is well-known to 
practitioners. Berestycki-Busca-Florent [1] applied non-linear PDE analysis to this 
problem. Henry-Labordére [9] applied a heat kernel expansion method and gave ап 
asymptotic expansion formula for a mean-reverting SABR model. 

In this paper, we take an approach based on Malliavin calculus. The theory of 
asymptotic expansions of probability densities based on Malliavin calculus was orig- 
inated by Bismut [2] and was developed by Watanabe [17] and Kusuoka and Stroock 
[11, 12]. Many applications of this theory to finance were given by Yoshida [18], 
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Takahashi-Kunitomo [10] and Siopacha and Teichmann [16]. In [14], we gave an 
asymptotic expansion for implied volatilities of SABR model with time-dependent 
coefficients. Deuschel et al. [3, 4] gave density expansions for multi dimensional 
hypoelliptic diffusions (X, ..., X4) at fixed time T and projected to their first / 
coordinates. They applied their results to short time and tail asymptotics of implied 
volatilities for some stochastic volatility models. 

In this paper, we apply the methods of Kusuoka-Stroock [12] to the asymptotic 
expansion for implied volatilities of call options. The key theorem is given in [13] 
and also summarized in the Appendix. Finally we give explicit analytic formulae in 
general diffusion models. 

Let (2, F, P) be a probability space and let (W! (t), ..., W(t); t є [0, T]) be 
a d-dimensional Brownian motion. Let Vo, ..., Va € Ср ([0, T] x В“; R^). Here 
Ср ([0, T] x К^; RY) denotes the space of RY -valued smooth functions defined in 
[0, Т] x R whose derivatives of any order are bounded. 

Now let Х, (1), t € [0, Т], € є (0, 1], be the solution to the following stochastic 
differential equation: 


d 
ахі) = У eVi (t, X ()dW @) + Vit, Хе), 1i < М, 
К=1 
X,(0) = xo = (xå, ... x), xo € В“. (1) 


In view of financial applications, cf. below, we assume 


Vi = 0, Q) 
and the ellipticity of Vi, ..., V4 at xo, i.e. there exists a constant 6 > 0 such that 
d 
V0, хо) & 700, xo) = 81, (3) 
k=1 


where J denotes the identity matrix. Then there exists a unique solution to (1). 
Moreover, we assume that X; (7) is continuous in 7 with probability one. 

We investigate the distribution of X 1 (T). From the ellipticity condition (3), the 
law of X I(T), denoted by vs, is absolutely continuous and has a smooth density 
De(y). Let Н be the Cameron-Martin space of d-dimensional Wiener space. We 
consider the associated ordinary differential equation: 


а. M | | 
a (t; h) = >, Vi (t, y(t; hy)h* (n) + Vo(t, y(t; ћ)), гє [0, Т], heH, 
k=1 


y(0;h) = xo, xo ER". (4) 
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We define the energy function e : R —^ R by 
Ix. gr 
e(y) = inf Эі I (s) ds; h e Н, у (Т; h) = y]. (5) 
2 1=1 0 


Since Ys = 0, this energy function satisfies e(x4) = 0. Let us define a flow $: 
[0, T] x RY — RY by 
d N 
= OPUS. ЕЮ хєК, (6) 
$(0, x) =x. 


Then the map $(t,-) : RY — RY, t є [0, T] is a diffeomorphism denoted by ф;. 
Note that ol (x) = x!. We define 


N 


m ag! 
йв,» Ser oe ууа, фа, у, 1<ї<М, isksd, 0) 


j=l 
which is the push-forward of the vector field V by the map ¢;. Let us define 
(61-ум : [0, T] x RF > R by 


d 
“ылу Vi VG. 1süjsN. 
К=1 


From (3), the matrix ( gl )i<i,j<N is positive definite corresponding to the Riemaniann 
metric on К^. We define the generating operator L;, t € [0, T] by 


LS do n T SPEI 
(Lif) = 5 agence» + > Ved, 


fet? xeR”, repr] (8) 


where Vi € CE ([0, T] x К^; R^) is given by 
a 1 N d 829i 
Vit = 5 >, >, эт HOE Vint HE Vint фа, у), 1xix М. (9) 
k,l=1m=1 


Let us define linear operators V : Ср([0, T] x RY) > Ср (0, T] x К^) and 
Г: С2([0, T] x RY) 8 C; (0, T] x RY) ^ Cz (RI) by 
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(10) 


(ууа, х) = Yat x) [3 
A Т 9 
rf, g(x) = У eie of Е ) " E) dt. (11) 


i,j=l И 


Our main result is: 


Theorem 1 There is a constant ro > 0 satisfying the following (1) and (2). 
(1) The energy function e € C?’ (fixé — го, a + ro]) and there is a constant Co > 0 
such that the asymptotic expansion of energy e satisfies 


lew) [5-0 xD? 2 (у x» + ( 8 + elo | 


< Со|у—х0?, yelxg—ro.xg+70], (12) 


x 


where 
Е 11 3 т 11 
b, = g (t,xo)dt, № = 7 (Vg )(t, xo)dt, (13) 
0 0 
T 1 T 
af (Vg! (r, xo)dt + >| P(g"! g!!) Qr). 
0 0 


(2) There are constants C1, C2 > 0 such that the probability density рь(у) satisfies 
the following: 


(072) exp( 02) pelo) ay) — &axG)|s eC), y € Lx} — то, ху rol 
(14) 


Here, ао and a» are continuous functions which satisfy 


de(y)\ 2 L(y — хд)? 
00 pup F idi (ы) < Caly — х0], y € [xà — ro. xà + rol. 
(15) 
and 
1 L 502 3b 
1 2 

= І 16 
200 = Te 7a; 6p 18) (16) 

where 
L= Јетәр Lag (t, :)) (xo) du dt. (17) 
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Remark 1 We can restate our results (14) as the heat kernel expansion: 


1 
є(2л)!/? 


—e(y)/e? 


PeO) ~ е (aoCy) + е2а2(у) + O (e^). 


Next, we apply our results to the asymptotic expansion of call option values and 
their implied volatilities. We regard X E as the underlying of these options. Then the 
forward value of a call option of strike rate К and maturity Т is given by 


C.(T, К)  E(Xl(T) - K)), ee(0,1, K>0. 


We define smooth functions o, € Ср ([0, оо)), n = 0, by 


oo г? 
Qn(x) = f z” exp (-« — =) dz, x>0. (18) 
0 


Some properties of o, are given in Lemma 7. Since (12), we can define the following 
function q € Cu — ro, xè + ro]; R+) such that 


1 х dy 2 
e(x) = ЗГА 2] o [xd ro, XQ + ro]. (19) 


Then the asymptotic expansion of call option values are given by the following. 


Theorem 2 There are constants Ко < K, and C, such that the value of the call 
option with strike rate K, maturity T satisfies 


)Ra(e, K)| < Cie", 


K J2«K) 
Vin exp yes, Eye eay Og (Kg (O 


єє (0, 1], K € [Ko, Ki], 


where 


Ro(e, К) = eq(K) (е 3 eu) qa 2e (K)/6) 


ao(K) © 24007 e 2600/8) 
E | е 04000) а'(К) о 2200) 

2а(К) ^" a(K)q(K) 6Nq(0/ 3 4(К) 
x BALTIO 20K) 
ei 2e(K)/e) © ao(K)’ 


(20) 
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Next we calculate the asymptotic expansion of implied volatilities of call options. 
Let us define f € C? (R4; R+) by 


1 ( x? 
ex 
„2л P 2 


We can easily check that f is strictly decreasing and 


1 
Р(х) = 7 eo. x > 0. (21) 


ХО) = оо, floc) =0. 


Therefore the inverse function f^! : R} — R, is well defined. When we consider 
the following normal model: 


dX(t) 2odW(t), X(0) = xi, 
the value of the call option with strike rate К and maturity Т is given by 


1 $ a 1 К-х) 
(ex) - K)* exp 72 dx = (К-х): f( Um ), 


Therefore the implied normal volatility сап be written as 


Су(Т, К) = 


1 оо 
М2ло?Т L 


1 
K — xg 


J ORT, EC АП" 


o (T, К) = K > xè. 


The asymptotic expansion of the implied normal volatilities are given by the 

following. 

Theorem 3 The asymptotic expansion of implied normal volatilities are given by 

| = — x 
A/2e(K)T 


where 


уй, Eye exp(J)| < С(є++|К — xl’, Kel kik (22) 


IK xj /L 105 1b VK) 2 L 55 3b J2e(K) 
a a т 


= b? 2 6b 4b e bi 65 "Ab Б 
o e к= 2% зы) V2e(K)\ , zr 23 эз) VK) 
"Jh b ' 302 4 є BWN 2b? 26 Е ` 


Remark 2 Since we can give the same formula for put options, Theorem 3 still 
holds in the case K < ae The implied volatility for a put option of strike rate K and 
maturity T is the same as the implied volatility for a call option with the same strike 
rate and maturity due to the put-call parity. See Appendix 3 for the details. 
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2 Hamilton Equation and the Energy of Path 


In this section, we investigate the correspondence between the Hamilton equation 
and the energy of path defined by (5). Without loss of generality, we can assume 
Т = 1. Let H be a separable real Hilbert space defined by 


d pl 
H- L € Co([0, 1]; R?) : his absolutely continuous and > | | qa) pat « o]. 
: 0 
i-l 


The inner product is given by 


d pn 
(h, Юн = >, hi Ok dt. 
i=l 


This Hilbert space H is called the Cameron-Martin space. 
Let y(t; h), t € [0,1], ^ € H, be the solution to the ordinary differential 
equation: 


d 
d " . 
—y (t; h) = У Vie. ye DAO, 1<ї< М, tel0.1], 
dt 4 


y(0; h) = xo, xo € R”. 


Let (8)1<г„у<м : [0, 1] x RY > R be given by 
d . 
8! (t, x) = У Via, x) Vf (t,x). 
k=1 
We define Hamilitonian J£ : [0,1] x RY x RN > R by 


N 
1 ü 
HE (t,x, p) = poen (23) 


Then the correspondence between Hamilton equation and the energy of path is given 
by the following. 


Proposition 1 Let Ji : [0,1] x H — R be the solution to the following ordinary 
differential equation: 
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V " are NA 
r t, Y; h))J; (5 h)h (t), 
X 


d NUR 
i (4. - 
слою = 525 
k=l r=! 
Ji(0;h) = bj, 1<i, j <N, 


where буу is Kronecker's delta. Let J(t; h) = J7! (t; h). We assume there is пеН 


and à є RY such that! 
N 


һо = $ Dy (1; ho). Q4) 
К=1 


We define x, p € С®([0, 1]; RN) by 
x(t) = y(t; ho), 


pit) = E par Ј @ ho) ECA; hoje (25) 


Then (x, p) satisfies the Hamilton equation: 


d og 0 
FS (t) = m ш p(t)), 


д 
дх! 
x(0) = xo, xo € К". 


© pilt) = - Ht, x(t), pO), 0<ғ<1, 1Si=N, (26) 


Furthermore, we have X = p(1) and 


N 


но = Ур) (х0), O<t<l, 1<k<d, 
i=1 
Wo? = У / ea seopio pid. ол) 
i,j=l 


Proof We note that J : [0, 1] x Н — К satisfies the following ordinary differential 
equation: 


а N 
d ð -~ =; : 
L Jih) = us — Vit, y(t; hb) Jis n)h* (0, 
TUGA 2.2.20 K (E, YE; AI; (t; hh qn 
Л(0; h) = ёр, 1<i, j <N. 


From Proposition 6.6 in Shigekawa [15], we have 


! We define Dy(-; РК] = d у(; А + ek). 


de 
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d N 1 | 
ру! (1; h)Ik = 2, Ji; h) | Jr G5 h)V] а, усп) да, 1«i x М, 
sil 0 


(28) 
From (25), it is easy to see A = p(1). Since ho = bye м Dy! (ho), we see that 


(ho, k) = P» T pi (t) Vi @, y(t; ћо) (oat. 


i=l /=1 


Therefore we have (27). We can check that (x(t), p(t)), O < t < 1, satisfies (26) as 
follows: 


^ а) = У Via. xq») = S eie, х@))ру@), 
j=l 
N 


“ pilt) = = E A 26 Ee xO) POH) = — У; ка 


К=1 ee у= 


j (0p). 


Remark 3 We will give a remark on condition (24). We define an energy function 
Е: R“ > Ras 


. 1 
Е(у) = ЕДЕ he H, yh) = у 


and let ho € Н be the minimizer of the energy function. Then we can apply 
Lagrange's method and there is a A є R such that 


N 
ho = > Dy (0; ho), 
К=1 


which is the condition (24). In particular, the condition (29) in the next proposition 
is corresponding to the energy function (5). 


Let us define the following notations. 


fe Stim 001809 _ 0 к>0, fgeC(O0,1D. 


In the following case, we obtain the asymptotic solutions. 
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Proposition 2 Letx(t; w), p(t; w) be the solution to the Hamilton equation (26) with 


w= (5 (i21, weR 56) 


0 2zxizN), 


under the boundary condition x (0) = xo, р(1) = А. Then the asymptotic expansion 
of x! (1; w) is given as follows: 


x (l; w) y xo + biw + baw? + byw", (30) 


where bi, b», b3 are defined by (13). 


Proof 'The solution can be written as 


Noo 
хи) 2x) Y. | 87 (s, x(s; w)) pj (5; w)ds, (31) 
j=l 


1 L gir 
т@; w) = pid; w) + 5 >, / gi Х®› wp s) w)pr(s; w)ds. (32) 
uer 


We calculate the asymptotic expansion inductively. Since х (t; 0) = xo, p(t; 0) = 0, 
we have 


x(t; w) " хо, p(t; м) e. 0. (33) 


Since the integral term in (32) is of the second order in w and from the boundary 
condition (29), we have the first order expansion of p: 


P TP _ w (i—1) 
pi(t; w) à pam= fy (22<i<N). (34) 


We substitute (34) for (31), we have the first order expansion of x: 
x! (t; w) " x + (fo gi! (s, xo)ds)w. (35) 


Substituting (34) for (32), we have the second order expansion of p: 


1 N 1 a jr 
pi (t; w) E pii w) + 5 pat) aug OOS w))ds) pj t: w) pr(1; w) 


11 


1, [' ag 2 
= nai (f Cr (s, xo)ds)w . 
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We substitute (35) for (31). Then we have the second order expansion of x: 
| | N о 1 1 8g! 
x! (t; w) 2 x4 2 gÏ (s, x (s; v»[»;d: w)+ ш 2 0 xodr)w? as 


N 
sf e (s, xo)ds)w DUE T 
«sf [f ic m ш, хо) du ds) w?, 


From the second order expansion of p and the first order expansion of x, we have 
third order expansion of p: 


Л (и, xo)duds 


1 | gll Р 
pi(t; we paw s(f “E (s, xo)ds)w 
N 1 ggil 1411 
1 87 0g 
F 51 Z r (s, vo(f aur t xo)du)ds 


j=! 
1 2. M 5 
0^g jl 3 
«f Cr Umm Эрт (/ g (u, x0)du) ds} w 


Finally we have the following third order expansion of x: 


x (t; w) = x + (| gil(s, xo)ds) w 
N t 5 дв!! 
p / zm 
«f [ els, хо) 28 

1 kl 1 11 
+ » Hi ss, x0)( | " (u, vo(f A (r, xo)dr)du) ds 


jk=1 


+ aA s'eo С А (f eer, xo)dr) du) ds 
T Ji d (5, хо) (f “ш xo)du) (| gor, xo)dr) ds 

+ TE L аа v([ ечи o( f га (r, xo)dr )du)ds 
f ае eo([ are f g^ (r, xo)dr)du) ds 


Ш (и, xo)duds 


а 06 xo)duds) w? 
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1 t 92g! $ is 
L „= ДӘ _., jl kl 3 
+5 | x6 n e (u, xo)du) ( n e Cr, xo)dr }ds|w 


From the definition of the linear operator V given in (10), we have 


xl (1; w) 3 xj + biw + bow? + bzw’. 


3 Proof of Theorem 1 


3.1 Proof of Theorem 1 (1) 


Let X. be defined by X; (t) = $(—t, Xe (t)). Then X satisfies the following stochastic 
differential equation: 


d 
dXi(t) = е у Vit, Xe) dW а) + e Vit, Xe()dt, 1<ї< М, te [0,1], 


Xe (0) = xo, (36) 


where V is defined as (7) and (9). The solution to the associated ordinary differential 
equation y satisfies (37) in the next lemma. 


Lemma 1 Let y(t; h) : [O, 1] x Н — К, be the solution defined by (4). Let us define 
ў@ h) =t, yt; h), 1515 №, te[0,.1], 
then Y satisfies the ordinary differential equation: 


d 


Gh) = У Vie. ў; Р) О), 1<ї< М, тє[0,1]. (37) 
К=1 


Proof From the definition of ф given by (6), we have 


d 


-Vit фт, фа, уу) + У Vidi Cot, ф@, y) Vj E, p, у) = 0. 


j=l 


Therefore we have our lemma. 


Proof (Theorem 1(1)) Since yl = 0, we have y! (t; h) = y! (t; h), and the energy 
function can be defined as follows. 
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As spe LN 
e(x) = [xl [Sco ar : 510; h) sal 
k=1 


Therefore it is enough to prove the theorem for the driftless case, i.e. Vo = 0. 
Let ho be defined by 


ho(x) = argmin(e(h); h e H, yl: h) = x}. (38) 
We denote ho(x)(t) = ho(t, x). Then from non-degeneracy condition, there is an 
r > 0 such that ho(x) is unique in x € (xo — r, xo + r). Using Lagrange multiplier 
theorem, we have 


ho(x) = XQ) DF! (0, ho(x)), (39) 


where А : (xo — r, xo + г) —> К is a smooth function. Applying Proposition 2, we 
have 


x! (1; AQ) — (si + bial) + bà (x)? + b) = O(Ix — хој). 


Therefore we have the following asymptotic expansion of À in x: 


A(x) F cı(x — xd) + co(x — a + сз(х — хф?, (40) 
where 
1 b2 b3 b5 
q=—-,aQ= , Сз = +2 41 
Ib oap HW тъ SM 
From [13] we have 
de(x) 
Ах) = (42) 
Ox 
Since e(x4) = 0, we can calculate the path of energy by 
Е cI 12,€ 13, 63 1\4 
е(х) = |, ACy)dy ^ 5 (x = хо) + 3 (x — xg) + 4€ = X9). 
Therefore we have Theorem 1 (1). 
Let us define a : [0, 1] = R by 
t ~ 
a(t) = alf Vl (и; хой). (43) 
0 


Then we have the following. 
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Corollary 1 Let ho € H be the element defined in (38), then we have 
Або) — «C)G — xpllg = Ox — xol’). 


Proof From (27) and the proof of Theorem 1 (1), we have 
Noon 
кт, x) = х piu; w) Vi и, x(u; w))dt 
i=1 


~ (/ Wu (и; хой)» Pi (/ yt (и; хой) — xd). 


1 


3.2 Proof of Theorem 1 (2) 


In this section, we will use the same notations as in [12, 13]. Let (©, || - ||o) bea 
separable Banach space and (Н, || - || p) be a separable Hilbert space such that Н is 
a dense subspace of © and the inclusion map is continuous. Let us, s є [0, оо), be 
the (necessarily unique) probability measure on (©, о) with the property that 


|. expl Еи, Ө)]д, (4) = expC- 5 Iul, u € Ө”. 


We can rewrite (36) replacing &? by s : 


d 
dXi(t,0) = У Vi (t, Xs(t,0))d0* (r) + sVA(t, Xs(t,0)dt, 1i М, tel[0,1], 
k=1 
Х,(0) = xo. (44) 


Here we replaced X and V in (36) by X and V respectively for simplicity. 
Let us define Wiener functionals F! : (0, 1) x Ө x [х0 — p, x) +ro] > В, 1< 
i € N, by 


F'(5,0, y) = Xi (1,0) — y. (45) 


The main theorem in [13] is summarized in Appendix 2. To apply Theorem 7, it is 
necessary to check the assumptions (A-1), ..., (A-5) in Appendix 2. Since f = 0, 
we can check (A-1). Since A(0) — 0, we can check (A-2), (A-3) and (A-4) in the 
neighborhood of origin. Since the ellipticity condition at origin, we can check (A-5), 
using the same discussion given in Appendix B in [14]. Then we have the following. 
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For each (s, у) є (0, 1] x [—7o, ro], the density function p; (y) satisfies 


(у) 


[ exp(—) ps(y) — ag(y)| < Kos!?^, (s, y) є (0, 1] x [—ro, rol. 


The function ао € C([—ro, ro]) is given by 


dely) 
ду? 


(у) 


1 
д 
deu — BOD? exp( S A F'O, holy). у). 46) 


а0(у) = ( 
Here .& is called the heat operator defined by 
д 1 
f £60) = [É + Ztracen D? fV, Ө), 
$ 


апа 


д 
В(у) = о pe ! (0, ho), y). (47) 


In this section, we calculate each terms in right hand side of (46) explicitly. First we 
calculate the heat operator. 


Lemma 2 There are constants C > 0 andr > 0 such that 


| F 0, holy), у) = oy me [ Уи, xo) Vig! (e, xo)dudt 


+ 5 > Vi (t, x9) V? ned s! tu, xo)du) ar} | 


i,j=1 k=1 


= О(1у = х0), у> х0. 


Proof Since the adaptivity of X, we have 


d pl 1 
аға 0 = У | AV; (a, X, (us уйбу и) + | Vj (и, Ху (и, 0))du 
К=1 


1 
+s f ef[Vg(u, Xs(u,0)]du, 1<i<N. 
0 
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Therefore we have 


« F! (0, ho(y), y) 


N d 1 . . 
=> >. | Vj Vj (и, Xo(u, hous y))) A X) (и, holu; yah (и; y)du 
j=1 k=1 
1 N d 1 | | . 
5252. "i У? Vi (и, Хо(и, һо(и; y) (Хоби), Р.Х GO)h (и; y)du. 
i,j=1 k=1 


Then using Corollary 1, we have the following. 


N d 1 f 
|а F! (0, ho), у) — o-3»(X | Vj Vi (и, Xo(u; 0)) Z XQ (u; 0jå* (u)du 
j k=1 


j-l 


1 N d 1 | ч | 
+ PA V? Vg (и, Xo(u; 0)) (РХ (u; 0), DX) (u; ове (u)du)| 


= O(ly — х4), 


where / XÅ (t; 0) = [o Vd (и, Хоби; 0))du. 


Lemma 3 The Hilbert-Schmidt norm of D? F! is given by 
N d l pt 
|D°F' (0,0, xo) lls =2 9. У; n n 3! (u, xo) Vi, Vin (t, xo) Vi, Vin (t, xo) du dt. 
һ,„=1т=1 0/0 
Proof 'The Malliavin derivatives of Ж, 1 <i < М, to the direction k € Н is 


given by 


N d ц 
DX; hik = >. > | Vi V} (и, Xo(u; h)) DX} (u; ЦКА" (u)du 


1=1 m=1 


d 1 
+ х Vi (и, Хоби; h))E" (u)du. 


т=1 


General Asymptotics of Wiener Functionals and Application to Implied Volatilities 153 
The second Malliavin derivative of F! to the direction ky, k2 € Н is given by 


D? F (0, 0, хо) К] 


>A Vi V] (u, xo) DX4(u; О) J£? (u)du 


1=1 т=1 


+f АА (и, x9) DXQ(u; O)[ko J£?" (u)du 
0 


N 

E > ra ViVi, (t, x0) ( [ у; u, хоиш) k" (аг 
P 

E? 3 [ fo vi (t, xo) V}, (u, xo)li>u 


l=1 mı,m2=1 


+ Vi V, (и, xo) Vs, (t, x0) 1а и)" (DKS? (и) du dt. 
Therefore we can calculate the Hilbert-Schmidt norm of D? F! as follows: 


IPE 6 0, MIL 


E 3 T [i (УГУЛ (t, xo) V (и, хо) 


1=1 m,,m2=1 


+ У xo) V, (t, x0) lr<u)” du dt 


iy > f fhe xo) V}, (u, xo))? du dt 


1=1 mi,mo-l 


-2 У » [ [ Va V], (t, xo) V, (и, xo) Vi, V], (E, xo) VIP, (и, xo) du dt 


Ц, ng 1m,,m2=1 


=2 5 318 [ g^ (u, xy) Vi, V1 (t, xo) Vi, V] (t, xo) du dt. 


lj,l2—1 m=1 


Finally we will complete the proof of Theorem 1. 


Proof (Theorem 1 (2)) Using (46), we have 


де x 


ы 


1 
"оу LEO). + 5 Ios ( 775 


1 
logag(y) = — 5 Іов (дес (н — В(у))) + 


In the right hand side, the asymptotic expansion of second term is given by Lemma 2, 
so we will give the asymptotic expansion of the first term. 
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Since B is defined by (47) and E TO (y = x9), we have 


|B) — ci D^ F (0,0, xo) — x9)| = Oly — xo^). 
Since B(x}) = 0, if |y — xi] is sufficiently small, we have 
E 
detz (7 — B(y)) = ехр(— У -tracey (B(y)")). 
n 
n=2 
Therefore we have 


О „1\2 
OUT = 0(|у = xol’). 


(48) 


ID? F(0, 0, xo) ls 


log(det;(g — B(y))) + 


The Hilbert-Schmidt norm of D? Е is given by Lemma 3. Therefore we have 


(у= х0)? 
log ao) y Sy ЇР F' (0, 0, xo)llis 
1 
(y — х0) 1, (960) 
d (0. ho), ) + 5 los( 757) 


m - 2 
B S У У [ [ g"? (u, xo) Vi, V] (t, xo) Vi, V] (t, xo)dudt 


1, ,l2=1 m—1 


mnm 
(2b 2. "i [ Vj (и, xo) V ув! (t, xo)dudt 


= 2 
+32 = > x[ [+ m(t, 0) 12 (и, xo) Vi, 1 V4 (t, xo)dudt 


1, ,l2=1 m=1 


+ бн) 


From the definition of (8), we have 


(у= x4)" | 


1 
lo ы L(g! (t, - dudt + = log( 
sa); 25 O мето, чий + 5 loe( 775 


Then we have (15). 


General Asymptotics of Wiener Functionals and Application to Implied Volatilities 155 


Finally we calculate a2 (800, First WE give Hn asymptotic expansion of the density 
using Hermite polynomials. Let y — xd +e Jer Then the asymptotic expansion in 
£ up to the second order is given as follows: 


ez | edz 
De(y)dy = Pe (xd F va va 
1 ONMESNE TES 
~ (ag(x] + 4) + 200001) ex | 
ы Jer Y СМ2л d (5 (Fa) 3 (52) 
€^C3 РА 4 
—~ (—)") |а 
22) 
2 2 
c2 3 2f 2 6 C5 C3) 4 
= |1- elz — 3z) +e | z + z 
| dg i 18cj Ga 42) 
2 
[2 Зсз Lgi 2 2 1 
( 2c] ad * 2 ) + ax] Jecoaz 
— 1 c2 H 2 с5 Н, 2 сз C3 H. 
-| E 3/2 3(z) + € 18cj d E 4(z) 
1 
+ (8 y) + a Н6(0) (2 “imo 
2 " im 1 2с} 4c? 


cL 
—() mo] - ү >= | exp de, 


where H;, n € М, are Hermite polynomials e.g. 


Hy(x) = x? — 1, 

Нз(х) = = 3x, 

Ha(x) = x* — 6x? +3, 

Ho(x) = xf — 15y* + 45у2 — 15. 


Since p; is probability density, we have 


l= I pe (y)dy = ri ps (ez)edz. 


The orthogonality of Hermite polynomials implies 


[. HA, (z) —— 2-р) == n zl, 
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then we have 


2 2 
1 C$ C5 сз cL 
= Hg(0 Ні(0 Н (0). 
а2(х0) DS 6(0) ба ie! 4(0) — ( 5 ) H2(0) 


This completes the proof of Theorem | (2). 


The asymptotic expansion of the probability density in = using Hermite polynomials 
is given as follows. 


Corollary 2 For each z € R, let y = x} + NE € € (0, 1]. For any r > О, there 
is a constant C > 0 such that 


2222 esca - [1 — ee Hae) кк жү 
a) p PeO Von a(z 183 6(Z 2c) 4с? 46 


11 
+e (I=) (2) | <2C, єє (0,1, ze[-r.r]. 


4 Proof of Theorem 2 


First we prove the following theorem. 


Theorem 4 We assume ХІ(Т) has a density pe(y), y є Rand let 


а, (у) = (272°)! exp( Y yeR. 


We assume that there are constants N € N, Co > О and Ko > 0 such that 


N 
las) — У are(y)e**| < Coe?™t?, у € [х0, Ко], 
k=0 


and assume that the energy function e satisfies e'(x) > 0, x € (х0, Ko]. We define 


g:R— Rby 
2 


x 
е(80)) = 7. 


Since e is strictly increasing, g is well defined. Then there are constants K, < Ko 
and С, such that the value of the call option satisfies following: 


=i 
Vin exp Dyer, K) - ep (È Van K4(K) Ry. К) < Cue, 
e e (0,1], К є[хф,К\|]. 
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where 


5 Cnm (87 (KY) 9m (87 1 (O8) aes 


R К) = 
nle, К) coo(g-(K)) qi(g-(K)/&) 


п,т>0,п+т>1 
2n+4m+1<N 


Here ca, € C (R) is given by 


= м 1 а ук+! p d \m-k 
6500 = 2, E pim a во). (z-) А0), 


where 
Ак(х) = ax (gg (x), neN, x e[d, Kil. 


We prepare the following lemma for the proof of Theorem 4. 


Lemma 4 
Ao(xd) = 1. 


Proof Since 


j= 7 KTE эстип] _ ade 29533; 
ca (2л2)1/2 J E2 | 


we have 
1 


оо 2 
1 = aus] as(g(ey)) expC- 78 (ey)dy. 


—oo 
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(49) 


(50) 


(51) 


Since the right hand side is bounded, taking the limit of = | 0, we have 


ao(g(0))g' (0) = 1. 


Proof (Proof of Theorem 4) We can divide the value of a call option into two parts: 


C.(T, К) = C,(T, К) + Re(Ko), 


where 


1 1 
jon 


_ Ko Ko 
CT, к)= | (= Kyps()dy = f 0-5 
K K NE 


and 
Re(Ko) = EIX} (T) — K : X} (T) > Kol. 


)аг (y)dy, 
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Since e(g(x)) — a we have 


x2 
z2) (а(х) Go) dx. 


. g (Ko) loo 
GIU K) = LAN (g(x) — K0(5—35)* exit 


Let А„(х) = ae(g(x))g/(x) and Ke = 1(g^!(Ko) — g^! (K)). Putting x = ez + 
g (K), we have 


exp(4 xi )C«(T, K) 
Ke s 1 z2 zg (К) Z 
= 1 1 
= [| (g(ez + g^ (К)) c 2 А )Ae(ez + 87 (K))dz. 
We define a 
Äe n) = āe n (80) Œ) = У Arae”. 
We also define 
Cen(T, K) 
-lypey2 ГК 2 1 
g (К) e = 1 zg (К) 
= ex ( 5-02) | (g(ez + g^! (K)) – К) m exp( : ) 


х Ås n(ez + g (K))dz. 
Then there exist constants Су, C2 > 0 such that 


g (Ey 
262 ) 


exp( C, K) - Čen (T, K)| < ciim 


Since 


ez + a7 () — К)А, (ez +87! OO) 
- У обв) т < Саен), Ке xd, Ki. 
еа үсү 


we have 


———ф | = 
N т+ Е 


lexp(— 2t ONG K)— >, xig MK) trm 


п,т>0 
2п+т+1=№ 


< Веі, K € [xd, Ki]. 
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For any à > 0, we have 


К.(Ко) < EIX} (T); X1 (T) > Kol 
SEXY PROT) > Ko) 7. 


Therefore we have 


lim e? log Re(Ko) < lim е2(1 — ô) log P(X}(T) > Ko) = —(1 — 8)e(Ko). 
Е Е 


Note that e(Ko) > е(К |), we have 


lim sup e? log А. (Ко) < —e(K1). 
ERU 


The function q defined by (19) can be written as 


— o'{o7! E d -1 -1 
q(K) = g'(g o irr (K)) . 


Then we have our assertion. 


Finally we prove Theorem 2. 


Proof (Proof of Theorem 2) From the definition of А (=, К) given in (49), we have 


coi(g | (K)) eo (g ! (K)/6) 220208 1 UO) qx(g  (K)/&) 
co,o(g—!(K)) q1(g71 (K)/6) co,o(g-1 (K)) gi(g7!(K)/e) 
,cio(g | (K)/&) 
co,o(g- 1 (K)/&) 


К (е, К) = E 


The second and third derivatives of g at g^! (K) are given as follows: 
а вак) = aO (К) 
dk28 8 = quA )q , 


d? 
701808 К» = q(K)q (KY) + а(К)?а” (К). 
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Using the definition of cp given in (50), we can calculate соо, co,1. c1,0. со,2 explic- 
itly as follows: 


coo(g (K)) = ao(K)q(K)?, 
cio(g | (K)) = ax(K)q( K^, 


3 
co 1(g (K)) = ag(K)q( KP) + 540a O^ q (К), 


1 7 
co2(g (K)) = 549 00q (К )* + 2а0(К)4(К)?4' (К) + gae Og GO? g (К)? 


2 3 n" 
+ 3 40(K)q(K)'q (К). 


Then we have our theorem. 


5 Proof of Theorem 3 


First, we define smooth functions 0,, n є №, inductively by 


61(x) = ф(х), 
Өп+л (х) = —NOn (х) + Ө, (х) (х)х. (52) 


We define the function A : [0, 1] x R— R by 
h(t, y) = f tfo». (53) 


where f is defined by (21). The properties of Л are given in Lemma 9. Then we have 
the following. 


Proposition 3 The implied normal volatilities of call options are given as follows. 


А e(K — xa) 1+l(e,K) 1 g (K) А 
oy(T, К) = КТ ехр n, — MI), К > xg. 
Here 
Ke, К) = (1+ Re, K) +r(k)) - 1, 
where 
„2л exp SP) C (T, К) 
R(e, К) = Se 
€coo(2 ^ (K))g1(g7*(K)/e) 
and “1 E 
vcre & (K)coo(g (К)) 


(K — xd) 
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R and r satisfies the following respectively: 
|R, К) — Км(е, K)| < Ce", (54) 


and 
lim r(K) = 0. 
Кух) 


Proof From Theorem 4 and Lemma 7, we have (54). Using I’ Hospital’s rule, we 
have | i 
р #(Юсоо(в CO) 
im 1 = 
K|0 K — х0 


g(x4)a0(x9) = 1. 


By definition of R, we can rewrite the value of call option as 
Co(T, К) = f(g” !(К)/є)в  (K)eoo(g (К) + R(e, К). 
On the other hand, the value of call option under the normal model is given by 


1 
K — xg 


c T ; 


V =(К—х0)/( 


Therefore we have 


1 
К — xj 
оу Т 


x«i K 
Fí ) = (1+r(K))(1 + RE, oE) 


Using the definition of h given by (53) and Lemma 9, we have our assertion. 


Next we will give the asymptotic expansion of implied volatilities. 


Theorem 5 For any N € N, there is a constant C > 0 such that the asymptotic 
expansion of implied volatilities satisfies the following: 


uo. N n4l -1 
| GARY on юе) 


< С(є+|К -xIp"*l, K e[xd Kil. 


Here 
Iw (e, К) = (1+ Ry (e, K) o r(K)) — 1, (55) 


where 
8. (K)eoo(g  (K)) 


r(K) Ка] 


1. (56) 
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Proof Using Lemma 9, we have 


GET »)| 260.21. 


Therefore 


1+1,К) | -lK М pltiy(e,K) Ө 
| oana, E yai 2, «OL ya; 
1 t E a0 1 


n! 


1+I(e,K) 1 -1(K Hy (6 K) | -l(K 
/ eae, Юу D lo, E ај 
1 t € 1 t E 


ІЛ 


N 


5 [ее Ө„ (у) 
1 


+ "0 Эа (t — D'ar| 
[4 € 


n! 


ү 1 g !(K) 
1 п=0 
Cill(e, К) — In (e, К)| + Calw (e, KIN < Cle |K — xo)”. 


ІЛ 


Finally we prove Theorem 3. 


Lemma 5 The derivatives of q, ао, a» at xo are given as follows: 


1 Ф090) 2c d'o S 363 
Jet ЕЗ. HE qGQ) | 9 2c 
ap) 1588 ар (x4) 102 c2\2 Зсз 
аб) а’ aD | (2) И 
а2 (х1) Е 1 ( Д (с) =) 

ag (xp) mm 2 САС 4c)" 


q(x4) = E 


where cj (i = 1, 2, 3) are given by (41). 


Proof Since 


(сбой) = ух? 
e(g(x)) = 4x, 
and g'(x) > 0, the derivatives are given by 


x = e'(g(x)g (х), 

1 = "(EN Ох)? + ее (х) 8" (х), 

0 = e" (gG)yg' GO? + 3e" (а(х) GOg" GO) + e (Gr G)8" О), 

0 = e? (eG) 8 (х)? + 6e" (g(x))g! GO? g" (х) + 3e" (g(x) 8" (х)? 
+ Ae" (а (х)) g' Gg" (х) + e (iG)? (ох). 
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Furthermore, since 


1 2b» брз  12b2 
el xl = 0, е” xl =, e!" xl Bue eO xl = 6 2 
( 0) ( 0) bi ( 0) b3 ( 0) bt b> 


we have 


2b 
800) = bi, g"(0) = =—, &”(0) = 
3b, 


A 
6 


(9b, b3 — 82). 


Lemma 6 
|Ro(e, К) — R&(e, K)| < C(e + |K — xi)’, 


К) - r* (| < CIK — xP, 


where 


RÌ(e, K) = 


e(K —x)po, Sen?  3сз\ф›(#!(К)/) 
Ja [az al ) lalaa O 


ere co\2 1603193(g ! (K)/6) 
e Ge 2] um 


"E. d 

2 2 

ur | 
2 З хс 4c 


F 1 сз eL 
3 СІ 4 c| 2 


сі 


1 
g 
2 


c1 


and 


гО(Ку = | |x х1)2. 


Proof We will calculate each terms of R2 given by (20). From Lemma 7, the functions 
фә /ф\ and фз/фу are bounded above. Since the first term is О (=) and other terms are 
O (25); itis enough to calculate the first order of K in the first term and Oth order in 
the other terms. Using Lemma 5, we have 


Cui _ 
соох) 


and the first derivative is given by 


=j Hn K f K 2 Г / K / 
2 оц 0) сюр (е j 34"(K) | aol ц) 
dK co,o(g (К)) ao(K) ^а(К) 2q(K)  ag(K) q(K) 
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Using Lemma 5 again, we have 


cu(g (K) | (K-xD[o, 5(ecy 36 
coo(g I(K) 1. Je [az 1 ) ы} 
со,2(871(К)) " 1 E =): о) 

coo(g !(K)) 0 сі 2 2 2c,4 
cio(g  (K)) Е =| e (6), | 
со,0(8—1(К)) 0 ci 2 3 4c44 


СІ 


С1 


Ci 


We can calculate r (K) in the same way and we have our results. 


Proof (Proof of Theorem 1.3) Using (55) we have 


In(e, K) e: RÜ(e, К) +r°(K). 


Since RS and r? are of the second order in є, K, we have 


2 nil -1 і 
р ае е K) +r°(K))g1(2—— E: 


Hence we have our result. 


6 Examples 


In this section, we apply our results to some known models. 


6.1 Local Volatility Models 


We assume the following model. Let o : R — Ҝ be a smooth function whose 
derivatives of any order are bounded. Let A be continuous R.,-valued functions 
defined on [0, Т]. 


d X* (t) = eX(t)o (X* ())aW,, 
X*(0) = xo. 
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In this case we can solve the energy as follows: 


«7 xf. aes) 


0 


where Е 
A= | A^ (t)dt. 
0 


The minimum energy path h is given by 


ie С] T ind 
o» x(f =) | (5) s. 


Xo 
We can easily calculate the coefficients. 


3 
bi = 0(x9)) 4, № = zo (xo^o' (xo) A’, 


8 2 
bs = (то Goo G9? + то G9)" Q9) л, 
= 1 2-7 2 1 3 2 -1 = ES i ELS 
L= (5000) o (xo) + 39 Xo) © (x0) A 8 O)= AA (/ 2 


Then using Theorems 1 and 3 we can calculate the density function and implied 
normal volatilities. We illustrate some cases. 
Example 1 (СЕУ model) This is the case A(t) = о and 


o(x) = xÊ. 


Each terms are given by 


2 a. 48-1 2 682 
А= Т, bi=x A, by = Вх A. bs = Q^ – B + 4x0 А, 


В, ap- "EET? 
Lo 7) AS е0) = YR 


1-8. „1=Ё 
ACA) GED 
(B = 1). 


810) = | 
73 10805) 


Example 2 (Displaced diffusion) This is the case A(t) = o and 
0 (x) = qx + (1 — q)xo. 
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19% —— — ——À 


Implied normal vol 


—- Asymptotic Formula 
11% &- Analytic Formula 


1095 1 1 1 | | 1 1 
0.25 0.5 0.75 1 1.25 1.5 1.75 2 


Strike 


Fig.1 Implied volatility smile of displaced diffusion, asymptotic expansion versus analytic solution 
with хо = 1.0, 4 = 0.5, о = 0.15, Т = 10 


Each terms are given by (Fig. 1) 


8 1 
A= о?Т, b= 354, b, = =хда А?, Рз = 3704 A^, L= 304 4^, 


Hee О. dx _ qy + ((—4)% 
g 0)= = log 
VA Jx 4Х+(1—4)х дул хо 
r 1-g (VA 
е (у) = x 
A(qy + (1 — q)xo) 
Black-Scholes model is the case q = 1. We present a numerical results of the 


asymptotic expansion formula, comparing with analytical solution. 


6.2 SABR Model 
We investigate the following model which is called SABR model. 


d X*(t) = во (t)o (X* (r)) (oaW(t) + y 1 — р242(1)), 
da? (t) = eva’ (t)dW (t), 
X*(0) = xo, a° (0) = о. 


This model was investigated in Hagan and Woodward [6, 14]. The energy function 
was given in Hagan et al. [8] as follows. 
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Е J/1-2pt + —p-t * $40» 
e(y) umi 2p, T log ae 


1—p 22T 


where 


v [P dz 
o= ETE 


In Theorem 3.1 [14], we also gave the energy function by solving Hamilton equations. 
Then the parameters are given by (Fig.2) 


3 
b; = о2о(х0)2Т, Ь = 59 i0) a (aa (x0) + vp)T?, 
8 2 
by = (5060 Goo" Qu + або toa" (хо) + бро Gro) ^a Gu)? 
2 
+ 2v? p?c (xo)^o* + 30 (x0)*v?) T?. 


o?o (x9)? T? 


L = E (a? et G9 + о G9)" (9) + 4vpao' (x0) +07), 
E y/1-2pt(y) + tOY — p +60) 
„МТ 1-0 | 


We present a numerical results of the asymptotic expansion formula comparing with 
Monte Carlo simulation. Here we assume o (x) = x^. 


22% - 


20% 


18% 


16% 


Implied normal vol 


1496 


" —— Asymptotic Formula 
12% -&- Monte Carlo | | 


10% 1 1 1 1 | 
0.25 0.5 0.75 1 1.25 1.5 1.75 2 
Strike 


Fig. 2 Implied volatility smile of SABR model, asymptotic expansion versus Monte Carlo simu- 
lation with хо = 1, œ = 0.15, 820.5, v = 0.2, р 0.2, T = 10. 
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Appendix 1 


In this section, we investigate some properties of functions defined in Sect. 1. First 
we consider gn, n > 0 defined by (18). 


Гетта 7 The functions on have the following properties. 
(1) ф(х) > 0, x > 0. 
(2) lim, o x” + o, (x) = nl. 


Qn (x) 
3 
cd 


« oo, п> 1. 


Proof (1) is easy to check. We prove (2). Putting y = xz 


2 


PAN 
2x? 


со 2 со 
у Y ndy 1 | E 
= = —y— d 
n(x) J Pg a = el Ж, ехр(—у )dy 


Then we have m" 
lim x"*!g,(x) = n уе Уау = п!. 
X—00 0 


(3) is an easy consequence of (1) and (2). 
The following is easy to check. 
Lemma 8 The functions {фп} satisfy the following recurrence relations. 
Qna 1(X) = =x Pn (x) + пфа-1(х), 
G(X) = —9n41Q). 


Example 3 qj (0 <i < 3) are given as follows: 


x? оо 2 
ex) = exp) / expC- 5d 


g(x) = —xgo(x) + 1, 
ф(х) = (х2 + 1)фо(х) — x, 
g(x) = —(x3 + 3x)go(x) + x? +2. 


Next we consider the function h є C??([0, 1] x R+) defined by (53). 


Lemma 9 The n-times differentiation of log h(t, y) with respect to t is given as 
follows. We define 0 in (52). 
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д 1 
(ә) log ht. у) = ah, у)), гє[0,1], у> 0, 
where 0, € Cp[0, со], n > 1 are given inductively as follows: 


A(x) = ф(х), 
On41 (2) = пб„(х) + Ө, 00610). 


Proof In the case n = 1, since f (A(t, y)) = tf (y), we have 


ah ie f (h(t, у)) 
at’ tf(h(t, y)) 
Since i 
Pieces Dues c. жей 
x gi(x) 
we have F(x) D 
E x) 2 p2 x) ү L 
өх) = ысу =(1+х°+х су) T0 


It is easy to check that 0; € Сь([0, oo]) and xO; (x) € Сь([0, оо]). We have 


a 1 
—logh = -0ı(h Я 
ap 98 (t, y) : 1h (t, y)) 


Since 
90,1 1 ; 
ay (gn On (Ut, у))) = ут (“Mn (h(t, y) + Ө, (A(t, YVA Al, y)h(, у)), 


it is easy to prove our lemma. 


Appendix 2 


In this section, we summarize the main theorem in Kusuoka and Osajima [13]. See 
[13] for the definitions. 

Let f, g € (07; R) and F € 4*9 (s/; К^) be completely P-regular functions 
and Y be a compact subset in R. We assume the following. 
(А1) There is an о > 0 such that 


sup slog( exp EEPL (ap < оо. 


se(0,1] 2 
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We define е: RY — [—oo, оо] by 


_. HAI? | Е N 
e(x) = inf{—— — fO0,h): F0,h) =x}, xeR”. 


We also assume the following. 
(A2) For each y € Y, 


М(у) = {h € H; F(0, h) = у} + 0 


and that 


e(y) 


h 2 
= POT = у, у) 


for precisely one h(y) € M(y). 

We assume moreover the following. 
(АЗ) Т(у) = DF(0, h(y)) has rank N for every y є Y. 

Let z(y) = T(y)*(T(y)T(y)*)7!T(y), y € Y. z(y) is an orthogonal projection 
in H. Let t(y)- = Ig — л(у). Then л(у)- is also an orthogonal projection in H 
onto ker T(y).LetV(y) : Hx Н — R bea bilinear form given by 


VQ), №) 
= D? f (0, h(y)) (y) h, n (y) th’) 
+ (hy) — DFO, hO), TOŽ TOT OT! D? FO, hy) Gh, rhy. 


We assume the following furthermore. 
(A4) For all y € Y and h € H \ {0} 


Vo). h) < |А. 
Finally we define 


A(s,0) = DF(s,0)DF(s,0)* 
= ((DF; (s, 0), DFj(s,0)) H)1&i, j€N 


and assume the following. 
(A5) For any p € [1, оо) 


Tim tog¢ | | det A(s, )|-? us(d0)) € 0. 
540 o 


Then Kusuoka-Stroock [12] proved the following. 
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Theorem 6 For each s € (0, 1], a signed measure P,(-) on RY given by 


Р,(Г) = | g(s, 8) exp (£2) us (06), P € (RP), 
F(s, ayer 


S 


admits a smooth density ps(-) with respect to Lebesgue ’s measure. Moreover, there 
exist sequence {an} у € С(Ү; R) and {К„}?° 9 € (0, оо) with the property that, 
for every n € N, 


n 
Ол Pes py 0) — > "ан Cy) SRO", (s, y) € (0, 1] x Y. 
m=0 
The main theorem in Kusuoka-Osajima [13] is the following. 
Theorem 7 e is smooth in the neighborhood of Y and 


N 
a А 
абу) = (det V^e()) "^ det; — BO)" exp (^ a OF (0, A (y) 


1=1 ! 
+ fO, hO) 
for y € Y, where 
N 


a | 
В(у) = У, 3, 00D O,AO) + D'f(0,h), | ye. 


ї=1 : 
Here we identify a continuous symmetric bilinear form B : Н x Н —> R with a 
bounded symmetric linear operator B : H — H given by 
(Bh, k)y = Bih,k),  hke H, 


and det, is a Carleman-Fredholm determinant (c.f. Dunford and Schwartz [5] 
pp.1106). 


Appendix 3 


In this section, we discuss about the implied volatilities for the case K < a. We 
define the forward value of a put option of strike rate K and maturity T by 


РЬ(Т, К) = Е[(К — X1(T))*] 
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Since we have put-call parity, the implied volatility of the put option is the same as 
the implied volatility of a call option with strike rate K and maturity Т. Since 


Р,(Т, К) = Е[(—Х}(Т) - (-K))4] = El(-(X} (T) — xD) — (—(K – x)))41 


It is enough to discuss in the case 3i = 0. 
_ Letx = (xl,..., x") e R”. We denote x = (—x!,x?,..., х"). We define 
Х.(1) = X(t). Then we have 


d 
аХ!@) = У eVit, X aw) Via, XeO)dt, 1i М, 
К=1 
where 
КЕ =V}, X) (0 xkzxd) 
V? (t, x) = | 
Vig (П<к<а,]#1). 
Since the associated Riemaniann metric gï (t,x) = е, vi (t,x) V, (t, x) is 
given by 


glia, x = gla, x), g'(nx) = 61, х) G0, gl), x) = gtt, x) i, j # 1), 


we have 


b=), b;—-—b, з=, L=L. 


Therefore Theorems 1 and 3 still hold for К < Хо 
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Implied Volatility of Basket Options 
at Extreme Strikes 


Archil Gulisashvili and Peter Tankov 


Abstract In the paper, we characterize the asymptotic behavior of the implied 
volatility of a basket call option at large and small strikes in a variety of settings with 
increasing generality. First, we obtain an asymptotic formula with an error bound 
for the left wing of the implied volatility, under the assumption that the dynamics 
of asset prices are described by the multidimensional Black-Scholes model. Next, 
we find the leading term of asymptotics of the implied volatility in the case where 
the asset prices follow the multidimensional Black-Scholes model with time change 
by an independent increasing stochastic process. Finally, we deal with a general 
situation in which the dependence between the assets is described by a given cop- 
ula function. In this setting, we obtain a model-free tail-wing formula that links the 
implied volatility to a special characteristic of the copula called the weak lower tail 
dependence function. 


Keywords Implied volatility asymptotics * Basket options + Index options * Large/ 
small strikes - Time change * Copula 
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1 Introduction 


In option markets, prices of vanilla call and put options are commonly quoted in terms 
of their implied volatility I(T, К), defined as the value of the volatility parameter 
which must be substituted into the Black-Scholes option pricing formula to obtain 
the quoted option price. Similarly, given a risk-neutral model, one can define the 
function (T, К) +» I(T, К) from the prices of vanilla options computed for that 
model. However, since in most stochastic asset price models the implied volatility 
function is not known explicitly, it becomes important to obtain efficient and accurate 
asymptotic approximations for it. Such approximations are useful for at least two 
reasons. First, they may shed light on the qualitative behavior of the implied volatility 
in the asset price model, and also on the effect of different model parameters on 
the shape of the model-generated implied volatility surface. Second, they allow to 
perform an approximate calibration of the model by comparing the market implied 
volatility with the asymptotic approximation. Such preliminary estimates can be 
used as intelligent guesses in the construction of a numerical calibration algorithm 
to accelerate its convergence. 

Approximations to the implied volatility have been studied by many authors in 
a variety of asymptotic regimes, both in specific models and in model-independent 
settings. One of the early references on the subject is the book by Lewis [31] dealing 
with stochastic volatility models. Various model-free formulas describing the wing 
behavior of the implied volatility were obtained in the last decade. To our knowledge, 
celebrated Lee's moment formulas were the first model-independent asymptotic for- 
mulas for the implied volatility at extreme strikes (see [30]). Lee's results were 
later refined by Benaim and Friz [8, 9] and Gulisashvili [22-24]. In Gao and Lee 
[19], higher order asymptotic formulas for the implied volatility at extreme strikes 
were found, and in Tehranchi [41], uniform estimates for the implied volatility are 
obtained. Small-time behavior of implied volatility is analyzed, among other papers, 
in [11] (їп local volatility models), [17] (for the Heston stochastic volatility model), 
[33] (for jump-diffusions), and in [2, 16, 34, 38] (for exponential Lévy models). 
Formulae for the implied volatility far from maturity are given in [18] (for the Hes- 
ton model) and [40] (model-independent). Finally, sharp price and implied volatility 
approximations for various models have been obtained as "expansions around the 
Black-Scholes model” in [10, 21]. 

Implied volatility is also quoted in the market for options on a basket of stocks. 
Note that the Black-Scholes formula can be applied to price a vanilla option by 
considering the entire basket (index) as a log-normal random variable. In particular, 
options on stock indices or major exchange traded funds are often liquid and quoted 
in terms of their implied volatility. Several studies [5, 13, 29] explore the relationship 
between the implied volatilities of index options and those of the constituents, with 
the aim of designing dispersion trading strategies. Another example is provided 
by swaptions, which are also quite liquid, often quoted in terms of their implied 
volatility, and can be interpreted as basket options on the underlying Libor rates 
[4, 37]. A tractable relationship between swaption and caplet implied volatilities 
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could be used to design a calibration procedure for the correlation structure of the 
Libor rates. 

In the above cases, finding reliable asymptotic approximations to the implied 
volatility can be even more important, since calculating the exact value numerically 
can be computationally very expensive due to the large dimension of the basket. 
Approximations based on the small-noise asymptotics in multidimensional local 
volatility models have been developed in [5] and more recently refined in [7], but in 
other asymptotic regimes, much less is known about multi-asset options, than in the 
single-asset case. 

Our main goal in the present paper is to characterize the asymptotic behavior of 
the implied volatility of a call option on a basket of stocks (with positive weights) 
for large and small strikes. Three different classes of multidimensional risk-neutral 
models with increasing generality are considered in the paper. In Sect. 3, we discuss 
the case of correlated log-normal assets, in other words, the assets which follow the 
multidimensional Black-Scholes model. Using a recent characterization of the tail 
behavior of sums of correlated log-normal random variables [27], we obtain a sharp 
asymptotic formula with error estimates for the implied volatility at small strikes. On 
the other hand, the asymptotics of the implied volatility at large strikes can be easily 
characterized using the results obtained in [3]. It turns out that for very large strikes, 
the implied volatility of a basket call option converges to the highest volatility among 
the stocks in the basket. 

Section4 deals with the case where the assets follow the multidimensional Black- 
Scholes model time-changed by an independent increasing stochastic process. It is 
assumed in this section that the marginal density of the time-change process decays 
at infinity like the function s к> s%e~® with a € R and > 0. The class of such 
models, includes standard multidimensional extensions of various exponential Lévy 
models, for instance, of the variance gamma model, the normal inverse Gaussian 
model, or the generalized hyperbolic model. These extensions were previously dis- 
cussed in, e.g., [15, 32, 36]. To our knowledge, for such a class of multidimensional 
models, the tail behavior of marginal distributions has not been studied before. In 
Sect. 4, we provide two-sided estimates for the distribution function of the asset price 
in the time-changed multidimensional Black-Scholes model, and use these estimates 
to find the leading term in the asymptotic expansion of the implied volatility. 

Finally, in Sect. 5, we deal with the case where the assets in the basket are corre- 
lated, and the dependence structure is described by a given copula function (we refer 
the reader to the book [12] for details on this modeling approach). Here we obtain 
an asymptotic formula that can be considered as a generalization to the multidimen- 
sional setting of one of the tail-wing formulae established in [9]. The new tail-wing 
formula uses a special characteristic of the copula called weak lower tail dependence 
function. 'This notion was recently introduced in [39]. 
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Remarks on the notation used in the paper 


Let f and g be functions defined on R, and let a € [—oo, oo]. Throughout the 
present paper, we write “f ~ g as x — a” provided that 


lim Хо) 
im 
xa g(x) 


= 1. 


We also use the notation “f < g as x — a" if 


Је) 2 


lim sup —— 
xa 9 ^ 


== 1, 


and write “ f (x) © g(x) as x — a" if there exist c; > О and c2 > 0 such that 


c1g(x) € f(x) < с29(х) 


for all x in some neighborhood of a. 
A positive function f defined in [a, oo) for some a > O is called regularly varying 
at infinity with index a € К if for any А > 0, 


NU T 
Fari f(x) 


for all о > 0. The class of all regularly varying functions with index o is denoted 
by Ra. The elements of the class Ro are called slowly varying functions. Regularly 
varying functions at zero can be defined similarly. 

The following set will be used in the paper: 


Ла: = (w € RT : ш> 0,i = 1,...,d, and Уш. 


Let ш € Ag. We set 


d 
E(w) := — > w; log wi, (1) 


with the convention x log x = 0 for x = 0. 
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2 Model-Free Formulae for the Implied Volatility 


Let X, be a non-negative martingale on a filtered probability space (Q, F, (7, }, о, 
IP). Consider a stochastic model where the process X models the price dynamics 
of an asset. Define the call and put pricing functions in the price model described 
above by 


C(T, К) = E(Xr — K)*] and P(T, К) =E[(K — Xr)*], (2) 


respectively. Here Т > 0 is the maturity, while К > 0 is the strike price. 
The implied volatility (T, К) |> I(T, К) is determined from the following 
equality: 
C(K,T) = Свѕ(Т, К,о = I(T, K)), 


where the symbol Сву stands for the Black-Scholes call pricing function. In the 
sequel, the maturity T will be fixed, and the implied volatility will be considered as 
a function of only the strike price. 

We will next formulate two model-free asymptotic formulas, characterizing the 
left-wing behavior of the implied volatility in terms of the put pricing function. These 
formulas will be needed below. Suppose the initial condition for the price process is 
Хо = 1. Suppose also that the asset price model does not have atoms at zero. The 
previous assumption means that P(X; = 0) = 0. Then the following asymptotic 
formula (a zero order formula for the implied volatility) holds: 


aos У® fe : lig lo _ о E T do lo = 
тү рк) 2 " Е року JTV Рк) 2 * BR) 


МЕ 
+0 (3) 


as К — 0. Here P isa positive function satisfying the condition P(K) ~ P(K) as 
К — 0. Formula (3) was established in [22] (see also Theorem 9.29 in [24]). The 
fact that the absence of atoms is a necessary condition for the validity of formula (3) 
was noticed in [14] (see also [25]). 
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The next asymptotic formula (a first-order formula for the implied volatility) can 
be easily deduced from the results formulated in [24, Sects. 9.6 and 9.9]: 


1 1 K 
= loglo + log B(K) 
JTV BOO 2 RU) S 


Z E Sun lo = + log B(K) 
JTV * PK) 2 ^ BEBE) 5 


K K -3 
+0 (= log PO (ioe zu] ) (4) 


vlog PR vlog FO | 


2./7,/log Pun 


as К — 0, where 


В(К) = (5) 


Formula (4) takes into account the results obtained in [19]. It provides more terms 
in the asymptotic expansion of the implied volatility at small strikes than formula 
(3) with P = P. More information on model free formulas for the implied volatility 
can be found in [24]. 


3 Basket Options in Multidimensional Black-Scholes Model 


Our goal in the present section is to characterize the asymptotic behavior of the 
implied volatility at small strikes in the case of a basket option of European style in 
the n-dimensional driftless Black-Scholes model. We assume that the interest rate is 
equal to zero. Let S |. , S” be a basket of assets such that 


log S; == log So — Сас + оз? уй, 
where S, = (SŁ, SOR So = (50, 3 $9); W 15 an n-dimensional standard 
Brownian motion, 93 is the covariance matrix, and diag(58) stands for the main 
diagonal of B. We denote by (A1, ..., An) € А, the weight vector associated with 
the assets in the basket. 

Consider the price process of the following form: 


n 
$ = У XS, г> 0. (6) 
i=l 
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The initial condition for the process S is given by So = У” № S and we will 
assume in the sequel that Si = l forall 1 < i < n. The previous condition implies 
that So = 1. Therefore, 


E >, ехр{ үр (7) 


where 


t 
- ез Team n isisa (8) 


In (8), the symbols 5j; stand for the elements of the matrix 982. We also set 


biit 
Ша = logi — >, l<i<n. (9) 


It is clear that the following equality holds: exp{ y ] ou yi, t>0,l<i<n. 


3.1 Asymptotics of Put Pricing Functions 
in Multidimensional Black-Scholes Model 


The distribution density of the random variable Sr will be denoted by pr. An asymp- 
totic formula for pr was recently established in [27]. Let us briefly recall the notation 
used in that paper. Let w € А, be the unique vector such that 


Bo = min wt Bw. (10) 
weAn 


The existence and uniqueness of w follows from the non-degeneracy of the matrix 
B. We let 


п :— Card (i = 1,. sw; #50}, I:—[(i-l,..n:wi; £0} := (k(D,..., kD}, 


i € R^ with jii; = Hro а and B є M; (R) with $i = = у, i jj. The inverse matrix 


of B is denoted by B- , and the elements and the row sums of 9З are denoted 
by aj; and Ay i= »XET aj, respectively. Since the variables Y1, ..., Y, in (7) are 
exchangeable, we can assume with no loss of generality that for the covariance matrix 
Ф, I = {1,..., n} with n < n. By the strict convexity of the objective function, the 


minimizer of min шш coincides with the first л components of w and therefore 
we€Ag 


belongs to the interior of the set RA. The minimizer over A; then coincides with the 
minimizer over the set (w € В: Уа ЧА 1 Wi = 1}, which means that 
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or, equivalently, 


А 
bec Es died s sg. (11) 


Mii А; | 


Since У) у A; > 0 (the matrix B7! is positive definite), this implies that Ак > 0 
fork — l,...,n. 

We will next formulate a condition under which the asymptotic formula for the 
density pr holds. 


Assumption (A) For every i € {1,...,n} \ I, (e! — w)+ Bw z 0, where е! є R” 
satisfies ej = lifi = jand e — 0 otherwise. 


Assumption (A) is a natural nondegeneracy condition for our problem. The following 
straightforward equality gives a relation between the optimization problem in (10) 
and a similar problem without the normalization constraint: 


2 


Р r | 1 
inf шш -r= inf —y By — 17v. (12) 
wedA,,r>0 2 veR”:v;>0,i=1,...,n 2 


A minimizer v of the right-hand side can therefore be constructed from the minimizer 
w of (10) as follows: 


А Ww 
v= ———. 
wt Bw 

Now, introducing the vector А є R? of Lagrange multipliers for the positivity con- 
straints on the right-hand side of (12), we get the Lagrangian jutu —1tv — Ае. 
At the extremum therefore, 8v = 1 + A, or in other words, 


Sw 

iiw | Pun 
Therefore, Assumption (A) simply states that for the constraints, which are saturated, 
the Lagrange multipliers are not equal to zero (since the constraints are inequalities, 
this is equivalent to the strict positivity for the multipliers). This is generally true, 
except when the solution of the unconstrained problem belongs to the boundary of 
the domain defined by the constraints. Assumption A is not restrictive and is satisfied 
in most applications. Note that if the row sums of the covariance matrix B satisfy 
Ai > 0,1 x i < п, then Assumption A holds. 
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It was established in [27] that under Assumption (A), the following asymptotic 
formula is valid for the density pr of the price Sr of the basket: 


1-а — Гета 
2 -1++ Whe Ak (vs аке gr) 


1 
рт(х) = Ст С 2) X 


=j 
exp [-z 6 toc Aj) log? L] ( + 0 ( (ке ~) )). (13) 


as х — 0, where the constant С is given by 


= 1 Ai o Ai 
тт || VÄ- Аң 


D. Aie. Aite tAn 
exp -a7 У a (o AH +) log — — + hir : 
i J 


(14) 


Cr 


Using formula (13), we can characterize the asymptotic behavior of the put pricing 
function P at small strikes. This can be done as follows. Consider the fractional 
integral of order two defined by 


Е›М(с) = Ге — с)М(т)ат, (15) 


where M is a positive function on (0, оо). Since 


K 
P(K)- n (K — x) pr Godx, 
0 
it is not hard to see that 
Р(К) = S-!FM(S), where S— K^! and М(у) = урт b. (16) 


Using (13), we get 
мо) = мо) (1+ О (02 71)) a7) 
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as у — оо, where 
1-я —2-T Ya Ag (10g 15" 414.7) 
Mi(y) = Ст (logy) ? y = A 


= = 
ехр = ++ da) og? y| SOY уро. (18) 


In (18), the constant Ст is given by (14). Set 
M36) = (ов у)! Му). (19) 
It follows from (17) that there exist c > 0 and y, > 0 such that 
IM)-MiQ)I < cM), у> у. (20) 


In [26], a general asymptotic formula was obtained for fractional integrals (see 
also Theorem 5.3 in [24]). We will next formulate this general result. Suppose 


M()-a(ye " {огай у> с 


where c > 0 is some number. Suppose also that the following conditions hold: 


1. у|а'(у)| € ya(y) for some у > 0 and all y > c. 
2. b(y) = B(log y), where B is a positive increasing function on (c, oo) such that 
В” (у) © 1 as y > oo. 


Then as с > оо, " 
Мс) 


pop + O((logo) !)). (21) 


FM (с) = 
The functions М = Mı and М = М» defined in (18) and (19) satisfy the 
conditions in the theorem formulated above. Applying this theorem, we obtain 


М (о) 
b'(oy? 


ЕМ; (о) = (1+ O((loga) !)) Q2) 
asc — oo, where i = 1, 2 and 


] - = 
bu) = 7 i +--+ Az) dog? u, Q3) 
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It follows from (19), (20), and (22) that 


F = u М! (с) =] 
2M(0) = F»Mi(o) + O(F2M2(0)) = vee + O(oggo) )) (24) 


as c — oo. Now, using (16), (23), and (24), we establish the following assertion. 


Theorem 1 Let P be the price of the put option defined in (2), and suppose Assump- 
tion (A) holds for the covariance matrix 98 (see [27]). Then, as К — 0, 


1 б] 1 62 2 1 1 —1 
P(K) = бо Esa (x) exp [stes Hd 1+0 (oe =) Я 


(25) 
where А 
Стт 3+п 
бо = z б=— 2 
(Ai +--+ + Аз) 
ja = А+: + А5 1l , 
в=-1- у DA (oe s). бз = gg (Ar cob Ая), 


and Ст is given by (14). 


Formula (25) will be used in the next subsection to characterize the left-wing 
behavior of the implied volatility associated with a basket option in the multidimen- 
sional Black-Scholes model. 


3.2 Left-Wing Asymptotic Behavior of the Implied 
Volatility Associated with Basket Options 


The next statement characterizes the asymptotic behavior of the implied volatility 
for small strikes. 


Theorem 2 Suppose Assumption (A) holds for the covariance matrix B. Then, as 
K — 0, 


1(К) = 


ao = Apud 
1 2 Dea Ak uc EE J 


= = = =з log 
VA, T o Аң 2(À1 +--+ Aj)? K 


айо logl = (1 p) «o ( ) (26) 
- — log log og og = 
А1 +--+ + Да)? K K Е 
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Remark 1 The leading term in the implied volatility expression above can also be 


written as | 
lim (К) = ——————— = / min wi$8y. (27) 
К {0 МА +: :: + А» weAg 


Formula (27) for the leading term of the implied volatility holds even if assumption 
(.А) is not satisfied—in this case, this formula can be obtained as a corollary of 
Theorem 9 of this paper. 


Proof It follows from (25) that as К — 0, 


1 : Ll : 6, 1091 : 62 | Edi 2 
o = 10 og log — — 62 log — og” — 
# (К) È So 8 BK T BK и 8 K 
ү! 
+0 log — 28 
(vez) ) M 
and 
1 =] : 61 log 1 : (62 + 1)1 l 
o = lo oglog — — og — 
EPK) E So 1 108 EK 2 EK 


1 тү! 
бз log? — log — 5 
+ 63 log 7 +( (We) ) (29) 


where бо, 61, 62, and ӧз are such as in Theorem 1. Moreover, the error term in (4) 
can be represented as follows: 


1 үз 
O | loglog K (vs x) : (30) 


We will next characterize the asymptotic behavior of log B(K) as К — 0. Denote 
the functions on the right-hand side of (28) and (29) by Vı (K) апа V2(K), respec- 
tively. Then, using (5), (28), and (29), we obtain 


1 
log B(K) = log —= +1 1 1 
og B(K) ены / 


Vi(K) — V2(K) 
Vi (K) 


It is easy to see that log(1 — /1—h) = log 4 + O(h) ash > 0. Put h = 
nOD. Then we have 
1 


1 Vi(K) — (К ү! 
кєк) = los e+ oe “ACO + о (ce) ) 


Implied Volatility of Basket Options at Extreme Strikes 187 


and hence 


1 1 тү! 
log B(K) = log ——— — log log — log — 1 
og B(K) SÉ rds EIL (ej) ) (31) 


as К — 0. 
Our next goal is to simplify formula (4) by taking into account (28), (29), and 
(31), and replacing the error term by the expression in (30). We can drop the terms 


O ((log P) in (28), (29), and (31), using the mean value theorem. This will 


introduce an error term O ((log р”) in the formula that follows from formula 
(4). Thus 


1 1 
AJmóy Е к 


rao = 32 по - br Vo(K) +10 
= VT 1 2 g v2 2 


/2 


= 1 = 1 
— — | V(K)— 5 108 V2(K) + log 


1 
Aji o C ^x 


Tu 
+ o( (ve x) ) 32) 


as К — 0, where Vi (K) and V2(K ) denote the functions on the right-hand side of 
(28) and (29), respectively, without the terms O ((log +) E Next, using the mean 


value theorem, we see that it is possible to replace V2(K) in the expression log Vo(K) 
in formula (32) by 63 log? К. Now, taking into account the definitions of Vj (K) and 
V2(K), we obtain 


J/2 3 1 1 xl 
RS — log | 44/70994 — (61 + 2)loglog z — 62 log = + 63 log x 


v2 1 бй. (01 + 2)logl | 62 + DI Lgs 84 
cT — log 00904 | — (01 oglog & — (02 о 3108 x 
1\2 
Ol [log — 33 
Ж (СОВ s 
as К — 0. Put 


е 
— log EI — (04 + 2) log log > — 62 log x 


hı(K) = 
1(K) dg 
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and 


3 
— log [eoi | — (бу + 2) loglog $ — (62 + D log у 
(К) = 


бз log? x 


It follows from (33) that 


„24/3 
Ут 


—2 
К) = eg [VOPR - VEF] + o ( (1e x) ) 


(34) 


as К — 0. Next, using the formula /1 + h = 1+ ih — gh? + O(h?) ash — O in 
(34), we get 


I(K) = : + ку (10 2E Bes log lo HC E 
Тз 463/21 03 ds 263./2T 3 ОАЕ 


1 —2 
+о((ь у) ) (35) 


as К — 0. Finally, plugging the values of бү, 62, and 63 given in Theorem 1 into 
formula (35), we obtain formula (26). 
This completes the proof of Theorem 2. 


Remark 2 (Implied volatility in the multidimensional Black-Scholes model for large 
strikes.) From Theorem 1 in [3], it follows that 


t log K — u}? 
PIS > j ntt «e| og K = 1) і 5 
A 27108 К 201 
where o? = тахџ „Эш, и = max, ,:93,=02 and m, = #{k : Зу = 


о2, ика = pj. From this result, we easily deduce that 


(log K — и)? 
ex 
log? K i 207t 


iG; — Куб А |. gos 


Applying Corollary 2.4 in [22] (which is nothing but the right-tail version of formula 
(3)), we conclude that 
(К) ) 


log К 


к) =0+0( 


as К — +00, where 1 is any function satisfying (К) — +оо as К — +00. 
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The function v/ can be removed from the error estimate in the previous formula, 
using Lemma 3.1, part 1, in [24]. The resulting formula is as follows: 


1 


as К — oo. 


Numerical illustration In this part of the paper we compare the theoretical left- 
tail limit of the implied volatility given by Formula (27) with the numerical values 
computed by Monte Carlo in the multidimensional Black-Scholes model. Figure 1 
plots the implied volatility of two basket call options as function of the strike price 
with 2-standard deviation confidence intervals (for 5 million paths), as well as the 
horizontal line corresponding to the theoretical limit. 

In the left graph, the basket contains two independent identical assets following 
the Black-Scholes model with volatility с = 0.3. In the right graph, the basket 
contains ten identical assets following the multidimensional Black-Scholes model, 
where the volatility of every component is с = 0.3 and the correlation between the 
log-prices of different components is p = 0.5. The maturity of the options is Т = 0.2 
years in both graphs. 

We observe that in both cases the volatility is almost constant as a function of 
strike (note the scale on the vertical axis), and for all strikes it is very close to the 
theoretical limit of Formula (27). We only show the zero-order term of the expansion 
in Theorem 2 because the higher-order terms do not lead to an improvement of the 
approximation for the strikes shown in the graph. Indeed, the higher-order terms in 
this expansion have a singularity at K — 1 and have a "reasonable" value only when 
log H is very small. 

For comparison, we also plot the implied volatility in the right wing in Fig.2. 
According to Remark 2, in the right wing, the implied volatility must converge to 
с = 0.3. However, from the graph in Fig. 2 we see that this convergence is very slow: 
for all strike values for which option prices may be computed without sophisticated 
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Fig. 1 Implied volatility of a basket call option in the multidimensional Black-Scholes model 
together with the theoretical 0-order approximation for the left wing. Left option on a basket of 2 
identical assets. Right option on a basket of 10 identical assets 
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Fig. 2 Right wing of the implied volatility of a basket call option in the two-dimensional Black- 
Scholes model together with the theoretical 0-order left-wing approximation 


variance reduction, the implied volatility, although it increases slightly with strike, 
remains close to its left-wing limit. 


3.3 The Case Where n 2 2 


The detailed discussion of the behavior of the distribution of the sum of two log- 
normal variables can be found in [20, 27]. The covariance matrix in this case is as 
follows: B = [bj], where bj = 07, biz = b21 = poo, by = 02 with сү > 0, 
o2 > 0, and the correlation coefficient satisfies —1 < р < 1. We will also assume 
70, > o2. Note that the case where p < A is a regular case, and Assumption (A) 
holds. In the case where p > md we have to rearrange the rows and the columns 
of B (see the example in Sect. 2.1 of [27]). Then B= (5), and Assumption (.A) 
holds. The case where p — a is exceptional. Here Assumption (A) does not hold. 
The following asymptotic formulas for the implied volatility follow from (26): 
e Suppose p > 22. Then 


2 
ү! тү? 
I(K) = 02 — o2 log № С x) +0 С x) (36) 


as К — 0. 
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e Suppose p « 22. Then 


gi 


T 3 oiT NP 
I(K) — Goo — соо 599 + log A1 — —5- — 1080 v 


TETE log(1 — 3) | à D)( 39 
+ | log A2 2 og v v оёт 


T , loglog 1 px 
> © DU + 0( (ve x) ) (37) 


K 


as К — 0, where 


ooy 1 — р? 


- 02(02 — роі) 
Соо = and v= 


2 2 d 


Therefore, the behavior of the implied volatility experiences a qualitative change 
(phase transition) at p* — са. Indeed, for р < р“, the expression in formula (37), 
approximating the left wing of the implied volatility, depends on the correlation coef- 
ficient, while for p > p* the left wing is approximated by a correlation-independent 
expression (see (36)). 

We will next discuss the asymptotic behavior of the implied volatility in the 
exceptional case where п = 2 and р = p*. The following formula holds for the 
distribution density pr in the exceptional case (see [20]): 


1 


са Гу ji 
рт(х) xx% С 3 i m С log 2) 
x x 
! [ (; JES ЕА ( G ) 
ехр og og log — — log [log [| — — 
2T (сї — o2) p? X p? 
1 2 
t loglog 3 Turn юл | 
log? 1 
exp } — - 38 
1 2То$ em 


as х — 0. Recall that we assume that jj = 0. Recall also that шіт and u2,r are 
defined in (9). 


Remark 3 Formula (38) can be derived from formula (B20) established at the end 
of the proof of part (1) of Theorem 2.3 in [20]. Note that in the present paper we 
assume с] > 02, while in [20], с < o». 
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Set 
1 1 
Үүт = log га — 1] + шт = m,r and № = log P —1]. (39) 


It is not hard to see using the mean value theorem that 


1 рү” 
log? (v + log log =) — (vs loglog 2) = 0(1) 
X X 


as x — 0. Hence 


Sap he (nen) ECL! 
exp 1 — — — — —- log* { V5 + log log — ~ exp 4 — —— — ———- | log log log — 
1 2Т(о? — 02) g 2; g Im р 27 (02 — 02) g log a 


as x — 0. In addition, 


1 1 1 

exp } ——5—— —5- | log log — | { log [| V» + log log — 
ines 1 1 1 
x | log – © ехр ү —— —3. { log log - | { log log log — 
x T(at — 05) X Я 


as х — 0. Therefore, (38) implies the following estimate for the density pr: 


x x 
ex log : ex — — (v lo >; 
Р 2То$ P 2T (c? — ©з) one x 
: ( 1001 3i 
exp 1 — ——5— ——5;- | log log log — 

B 2T (0? — 02) PORTS x 


1 1 1 
exp 1 ——— ——- | log log — } { log log log — 40 
Il g «Jf g log D) (40) 


as x — 0. 
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Our next goal is to obtain a two-sided estimate for the put pricing function P, 
by taking into account formula (40). We will use the ideas employed in the proof of 
Theorem 1. Let us set 


2 log? u (log log u)? 


+ 
2To3 27(02 — 02) 27(02— 02) Т(0? – 02) 


В(и) = (log и) log log u) 


and 
Ш.Т Ут Ут 1 


a(y) 2 y 7 (ову) "i (loglog y)" 17» * 
It is not hard to see that the restrictions, under which formula (21) 15 valid, are 
satisfied. In addition, for the function р(х) = B(log x), we have Р(х) ~ ох аѕ 


х — оо. Now, reasoning as in the proof of Theorem 1, we obtain the following 
formula: P(K) ғ P(K) as К — 0, where 


" 
_1—#2%Т 1 


1 uen 1 b -2 1 Ei 
~ То Т(07—05) Т(07—05) 
Р(К) = { 2 1 "ius 1 *2 1 1 tan 172 
i i) (ve i) (ve 79 
log? t 1 ( | | 

exp 4 — 2—4 } exp 1 -——-—-_ | log log — 

P| отог | P|) ar? — 03) (EEK 

l ( log | 31 

exp 1 ————, ——- [ log log log — 

P| 2r(1-eÓDVX 98 0 


1 1 1 
exp4 —_.——- { log log — log log log — 41 
[i g ex) ( g log 23] (41) 


as К — 0. Next, using (3) with Р given by (41), and making numerous simplifi- 
cations, we obtain the following asymptotic formula for the implied volatility in the 


exceptional case: 
jx 
IO) 2o 4 о( (е: x) ) (42) 


as К — 0. Comparing formula (42) with formulas (36) and (37), we see that the 
behavior of the implied volatility at the critical point p = e where the qualitative 
change happens, is similar to that in the case where p > e 
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4 Time-Changed Multidimensional Black-Scholes Model 


Recall that in Sect.3, we introduced the price process S for a basket of assets (see 
formula (6)). The present section deals with time changes in such processes. Suppose 
Tt, t > 0, is a non-negative non-decreasing stochastic process on (Q, F, {F }>0, Р) 
(a time change). Then, the time-changed process S has the following form: t +> S,,. 
We only consider time changes which are independent of the price process S. In the 
next subsections, two-sided estimates for marginal distribution functions of time- 
changed price processes such as above will be established. Moreover, the leading 
term in the asymptotic expansion of the implied volatility associated with a time- 
changed price process t +> S, in the n-dimensional Black-Scholes model will be 
found. 


4.1 Bounds on Distribution Functions 
of Sums of Log-Normal Mixtures 


The next assertion provides an upper bound for the distribution function of a random 
variable imitating the random variable S, for fixed t > 0. The additional drift vector 
jt will be needed later to ensure the martingale property. 


Theorem 3 (Upper bound) Let Y be a centered Gaussian vector with covariance 
matrix B = [bili<i,j<n, and let р € К" and р є К". Suppose Z is a random 
variable with values in (0, oo), which has a density p(x) satisfying p(s) < cse 9s 
for s > 1, where 0 > 0, c > О and a € R are constants. Then, there exists C > 0 
such that as k — +00, 


n 

Я А nm EN y —c*k 

PD eso HERR < e k] < Ске с е 
i=l 


where 


1,32 
(1+ tuc w) | (43) 


c* = min max (и + ЖОЙ: 


ї>0 WEAn 


Proof In this proof, C denotes a constant which may change from line to line. For 
k > 0, set 


n 
Е, (к) = P рэ eYi v kt+pikt+ fui < J | 


і=1 
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Fix w € An, and let t be such that 1 + tjj ^w > 0. Then, by Jensen's inequality, 


n 
P [> eri tik) < “| 


i=l 


n 
< э; + ktutw + йш + E(w) < E 


i=1 


" »( tthe a NALE 


м шикі 
z Cyt. | _ t t Eon] 
TJA 201580 


Сул | a ш | | oe | 
xp į —k exp 


= e 
d + tu bw) /k 2w-98wt 2w-98wkt 
"T E(w) + ftw pt w(£(w) + йш) 
i wtBwt wt Bw 
< су ехр p9 ia шш)” ехр ow 
^ Qc итш) 2w-98wt w-$98wt 
where €(w) is defined by (1). 
Consider the following function: 
(+ tutw)? 
F(t, = Ot + ————— 
бш) + 2wtBwt 


The following lemma establishes some properties of this function. The proof is given 
in the appendix. 


Lemma 1 There exists a unique couple (t, Ù), with t € (0, оо) and т € А, such 
that 


F(t, w) = min max F(t, ш). 
1>0 WEA, 


In addition, the function 


fO = F(t, w) 


has a unique minimum at the point t. 
We clearly have 1 + 7-0 > 0. Indeed, if 1 +f+w < 0 then ft < РФ) 


which contradicts the fact that 7 is the minimizer. If 1 + 70-0 = 0 then /'(7) = 0 
which also leads to a contradiction. Let 


= cie 


+00 otherwise, 
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Remark that if Т” < oo, then f(T’) = ӨТ” > f(t). Let us also choose Т small 


enough so that 


1 1 - 
1—l|u^twT- d 2). 
[А] 25 and с=т > /@) 


and assume that k is large enough so that К + 84w > 0. We bound the distribution 
function of the Gaussian mixture from above as follows: 


PLD ei Zim z Hii < eh] = ЕГЕ) 
i=] 


оо оо 
= n p(s) Fsjx(k)ds = К n p(tk) Е, (odt (44) 
0 0 
Т" a оо 
C (tk t S 
<k max F;(k)+k -CUI w Odt + ck ji eg "yer. 
0<г<Т МК + ш) T! 


(45) 


Now, by the choice of 7, the first term on the right-hand side of the last inequality 
in (45) satisfies | 
k max F,(k) x CA ke ?* 
0<г<Т 


with 8 > /(ї*). The second term is computed using Laplace’s method. Ask — +00, 
up to a constant, 


T CEOE до) р, ~ са а) 
е dt ~ Ск®е à 
т УК + pt)tw 
Finally, the last term is negligible by the choice of Т”. 
The proof of Theorem 3 is thus completed. 
Our next goal is to establish a lower estimate complementing the estimate in 
Theorem 3. Note that the estimates in Theorems 3 and 4 are off by the factor к". 


Theorem 4 (Lower bound) Let Y be a centered Gaussian vector with covariance 
matrix % and let р € К" and jj € R”. Let Z be a random variable with values in 
(0, оо), which has a density p(x) satisfying p(s) > cste— 9% for s > 1, where 0 > 0, 
с> O and o € Rare constants. Then, there exists C > 0 such that as k — +00, 


n 
РГУ eh Zenz eh < e *] > Chon ec. 
i=l 


where c* is given by (43). 
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Proof It is clear that 
i=l 


n er - 
аи < e > P[Y;Vkt + mikt + <—k — 1овп,і = 1,..., п]. 


By Proposition 3.2 in [28], the above probability can be bounded from below (very 
roughly) as follows: 


PLYiVkt + pikt + jjj < —k — logn, i=1,...,n] > ПЕТ ЕРУУ exp(—o;/2), 
where 
a= min xt Bly 


| 23 
XAR ((k4-log n)1+ktp+j2) 


T sik 1 ! : 
= max } –=и But+u UC log + kr + jo] 


ueR? | 2 kt 
(К +logn + ktutw + йш)? 
= max 
wed, 2w+Bwkt 
(14 tut wy? (1+ tut w)(logn + iw) (logn + йш)? 
< max k—————— + max + max ————————— 
wWE€Ag 2w-98wt weAn wtBwt wed, 2w+Bwkt 


Finally, we bound the distribution function of the Gaussian mixture from below as 
follows: 


IK 


n Е оо 
РГУ; efi Zi < e-k] = k f рК) Е, (аг > ck / (tke Е, (kdt 
А k 


i=l 0 i—1/ 
Ck(tk)®“ 
Sy ee NN 
“(A+R Jie 
C (tk)? 
- —— exp 
а+ка+т))у" 


= (1+ tu wy? 
—0tk — k —————— 
exp | max aper om 


Т а+А +2)" 


Otk — k max 


(1+ futwy? Ске 
wed, 2w-98wt 


Remark 4 Theorems 3 and 4 show that under their assumptions, the dominating 
factor describing the decay of the left tail of the price of a portfolio of assets is 
exponential with the decay rate equal to the constant c*. For example, for n — 1, we 


have 
(1+ gu — 20 + р2 + и 
5 А 
Oo 


202 I= 


c* = min(0t + 
1>0 
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In symmetric models with и = 0, the formula for c* simplifies to 


" 20 
minyeA, Wt Bw 


4.2 Implied Volatility Asymptotics 


Let S',..., S" be assets such that 
log S, = log So + üt + ur + 982 л. (46) 


where we use the same notation as in the beginning of Sect. 3. Let 5 denote the price 
process of the basket. Fix a maturity Т > 0, and suppose the random variable тт has 
a density pr. Suppose also that there exist cj > 0, c? > 0, 0 > 0 and a є R such 
that 


ciste” < pr(s) < caste, s >. (47) 
We assume that for every i = 1,..., n, 
D: 
0 > ui + 3 (48) 


This assumption implies that there exists = > 0 such that 


ELSE 55] < оо 


We then assume further that р; is chosen in such way that 


[150] = Si. (49) 


It follows from Theorems 3 and 4 that there exist Сү > 0, C2 > 0, and yo > 0 
such that 


a 


* ch ж 1 
Ciy* oe z) < PIS; < y] < Су“ Е z Oy «€ yo. (50) 
y y 


Since we have 


K 
P(K) =E|(K - 5.) | =] P[S;, < УМУ, 
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the estimates in (50) imply that there exist Сз > 0, Сд > 0, and Ko > 0 such that 


a 


x 1 a-n PY 1 
pu LE x| < Р(К) < СК + LE x| ‚ К<Ко. (51) 


Note that the put pricing pricing in (51) is squeezed between two regularly varying 
functions with the same index of regular variation at zero. Such estimates allow one 
to find the leading term in the asymptotic expansion of the implied volatility near 
Zero. 


Theorem 5 Suppose condition (47) holds for the time-change process т and that 
the assumptions (48) and (49) are satisfied. Then the following asymptotic formula 
holds for the implied volatility in time-changed n-dimensional Black-Scholes model: 


m d 
T °S к 


I(K)^ ( 
аз К — О, where the function w is defined by 


Wu) = 2 4(уи2 +и и), u>0 (52) 


and the constant c* is given by Formula (43). 


Proof 'Theorem 5 follows from (51) and Theorem 10.28 in [24]. 


Remark 5 Condition (47) holds for many processes commonly used as stochastic 
time changes, e.g., for the gamma process, the inverse Gaussian process, or the 
generalized inverse Gaussian process. The latter process is used as time change in 
the generalized hyperbolic Lévy model. Recall that the density of the gamma process 
is given by 


л“ 1—1„—Х. 
— ct— —AS А 53 
pi G) ion (53) 


while the density of the inverse Gaussian process is as follows: 


Pt (s) = 25 e^t mÀA-As—nc^r [s 
$ 


In the previous formulas, the symbols А and c stand for the parameters of the distri- 
butions. 


We close this section with a counterpart of Theorem 5 for the right tail, which can 
be deduced from Theorem 10 proved in the next section. 
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Theorem 6 Suppose condition (47) holds for the time-change process т and that the 
assumptions (48) and (49) are satisfied. Then the following asymptotic formula holds 
for the implied volatility in a time-changed n-dimensional Black-Scholes model: 


Nie 


T 


(К) ~ =) /log K 


as K — +00, where 


| 209i + и = ш 
= min P 


i=1,...,n Bi 


min 
С 


Proof Let Gi(x) = P[log Si > x]. By Theorems 3 and 4, there exist constants C1 
and С» such that _ 
Сухе A > бү(х) Z Cox" "е 


4/2091: + u? — ui 


Bii 


as х > +оо, where 


GS 


Note that in the single-asset case Theorems 3 and 4 can also be applied to the right 
tail, by symmetry. It follows that 


Gi(x) ^ =сіх 


as x — +00, and the proof may be completed by applying Theorem 10. 


Numerical illustration In this part of the paper we illustrate the asymptotic result of 
Theorem 5 with a numerical example. Figure3 plots the squared implied volatility of 


x x Monte Carlo x x Monte Carlo 
о.09 — Theor. asymptote 1 0.09 — Theor. asymptote 
0.08 
0.07 [у 
ч 
E 
0.06 
0.05 
0.04 
0.03 п | | 0.03 j | | 
60 70 80 90 100 60 70 80 90 100 


K, log scale K, log scale 


Fig.3 Implied volatility of a basket call option together with the theoretical asymptote. Left option 
on a basket of 2 identical assets. Right option on a basket of 10 identical assets. The logarithms of 
asset prices follow the multidimensional variance gamma model 
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two basket call options computed by Monte Carlo as function of the strike price on 
logarithmic scale, as well as the theoretical asymptote with slope given by Theorem 
5. Note that Theorem 5 only provides the value of the limiting slope of the squared 
implied volatility. Therefore, the performance of the asymptotic results should be 
evaluated by comparing the slope of the wing of the smile with the slope of the 
straight line. The intercept of the straight line has been chosen to keep the straight line 
relatively close to the curve, solely for the purpose of visualisation. The confidence 
intervals for 5 million simulated paths are very tight and not shown on the graphs. 

We focus on the left wing of the smile since in the left wing the slope of the 
smile is correlation-dependent, and therefore can in principle be used to calibrate the 
correlation structure. Also, numerical experiments for the right wing (not presented 
in the paper) show that one needs to go much further into the tail to observe the 
asymptotic behavior predicted by Theorem 6. 

In this numerical illustration, the time change follows the variance gamma law 
(53) with А = 10 and c = 10. In the left graph, the basket contains two identical 
assets with price processes given by (46), where we take u = 0, Si = 50 fori = 1,2 
and the covariance matrix which satisfies B; = o2 with o = 0.3 for i = 1,2 and 
5 = 0 fori A j. In the right graph, the basket contains ten identical assets with 
price processes given by (46), where we take и = 0, Sọ = 10 fori = 1,..., 10 and 
the covariance matrix which satisfies B; = 02 for i = 1,...,10 and Bij = po 
with p = 0.5 fori з j. The maturity of the options is T = 0.2 years in both graphs. 


5 Assets with Dependence Structure Defined by a Copula 


A popular approach to pricing European style multi-asset options is to calibrate 
full-fledged models for marginal distributions of asset prices, and then use a copula 
function from a simple parametric family to model the dependence structure. This 
is because information about the marginal distributions can be extracted from the 
prices of single asset options, which are liquidly traded, but the market quotes offer 
very little information about the dependence. 


5.1 A Very Brief Primer on Copulas 


Recall that the copula of a random vector (X1, ..., Xn) is a function C : [0, 1]" — 
[0, 1], satisfying the following conditions: 


e dC is a positive measure in the sense of Lebesgue-Stieltjes integration. 
e C(uj,..., ug) = О when ик = О for at least one k. 
e C(u1,..., ug) = uy when и; = 1 forall i Æ k. 
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In addition, it is supposed that 
PIX <X1,...,Xn € XJ] = CPX: = xis PX, Xx Qu...x)€ К". 


А copula exists by Sklar's theorem and is uniquely defined in the case where ће 
marginal distributions of X1, ..., Х„ are continuous. We refer to [35] for more details 
on copulas. 

The Gaussian copula with correlation matrix R is the unique copula of any 
Gaussian vector with correlation matrix & and nonconstant components (it does 
not depend on the mean vector and on the variances of the components). 

Given a function ф : [0, 1] — [0, оо] which is continuous, strictly decreasing 
and such that its inverse 9^! is completely monotonic, the Archimedean copula with 
generator ф is defined by 


С(и\,..., Un) = à (Qui) +- + Ф(иһ)). 


Definition 1 The weak lower tail dependence function x(a, ..., ап) of a copula 
C is defined by 
( у= min; log и“ 
ee — lim ; 
ANH HT 0 log (иа. ue) 
provided that the limit exists and is finite for all о, ..., а» > 0 such that ag > 0 


for at least one К. 


We will next formulate several known assertions (see [39]). 


Theorem 7 Let X1, ..., Х be random variables with state space (0, оо), marginal 
distribution functions F\,..., Fa, and a copula C. Suppose that for every К = 
1,...,n, the function Fx is slowly varying at zero, and there exist constants nk, 
] < k <n, and a function Е such that 


log Fy(x) ~ log F(x), 1 kn. 


Suppose also that the copula C admits a weak lower tail dependence function x. 


Then, 
log P[X; +---+ X, <x] 1 


im - = 
х0 min; log P[X; < x] Xni Mn) 


Theorem 8 


e Assume that a copula function C has strong tail dependence in the left tail, meaning 
that the limit © 
з їйї CIEL 
и}0 и 


exists and satisfies AL > 0. Then, the weak lower tail dependence function of C 
satisfies X(01, ..., Qn) = 1. 
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e Let C be a Gaussian copula with correlation matrix R such that det R 4 0. Then, 


X(01,..., ап) = тах а; min шТ Уш, for all o4,..., аһ > 0, 
i weAn 
where the matrix У: has entries Xj; = -Ri 1<1, у) <п 
у матау’ Shjan 


e Let C be an Archimedean copula with a generator function ф such that log ф Г! is 
regularly varying at оо with index А > 0. Then, 


тах(о, ..., аһ) 


(a^ on 


x(a1,...,04) = 


5.2 Copulas and the Implied Volatility Asymptotics 


In this subsection, we study the left-wing behavior of the implied volatility associated 
with a basket call option. Recall that we denoted by (Yj,..., Y,) the vector of 
logarithmic returns of the risky assets, and by (A1, ..., An) the corresponding vector 
of weights. Let C be the copula of the vector (Y1, ..., Yn), and С; be the distribution 
function of Y; for i — 1,...,n. The implied volatility is considered in this section 
as a function k +> I(—k) of the variable —k, where k is the log-strike defined by 
k — log K. Thetail-wing formulas due to Benaim and Friz (see [9]) play an important 
role in the sequel. 


Theorem 9 Let o > 0, and assume that the following are true: 


e There exists = > 0 such that E[e~*"'] < oo, = 1,...,n. 
e Forevery 1 <i <n, the function k œ> — log С; (К), k > ko, belongs to the class 
Ra of regularly varying functions, and there exist positive constants 11, .. . , Nn 


and a function G such that 
log G;(—k) ^ ni log G(—k) ask > oo. (54) 


e The copula C admits a weak lower tail dependence function x. 


Then, 


Ld - ип 
I(—k) =~ ul logG(—k) maxim; | (55) 


k k — xn... Mn) 
as К — oo, where the function w is defined in (52). 


Proof The distribution function F; of the random variable A; S; is given by 


Е (х) = С; (log x — log A;). 


204 A. Gulisashvili and P. Tankov 


Since the function log G; is regularly varying at —oo, it is clear that log F; is slowly 
varying at zero and 


log F; (x) ^ log С; (log x) ~ ni log G(log x) 


as x — 0. It follows from Theorem 7 that 
max; 7; 


log F(x) ~ — — — — 
xni... -> Mn) 


log G(logx) asx — 0, 


where F is the distribution function of У,” у A; S;. Equivalently 


max; Ni 


log F(e^^) ~ 
xn. ts Mn) 


log G(—k) ask > oo, 


and hence 


log Е (е) logG(—k) тах; mi 
k k XM -- -> Mn) 


ask — oo. (56) 


It follows from the assumptions in Theorem 9 that log G(—k) € Ra ask — oo. 
Therefore log F(e~*) € Ra as well. Next, using the tail-wing formula of Benaim 
and Friz (see Theorem 2 in [9]), we obtain 


IKÊT | log F(e~*) 
k v | k 


| ask — oo. (57) 


We will need the following lemma. 


Lemma 2 Lety be the function defined by (52), and suppose рі and p» are positive 
functions on (0, со) such that 


pi(x) 


> 1 as x оо. (58) 
p2(x) 
Е (iG) 
pix 

vog "7 х — oo. (59) 


Proof It is not hard to see that for all и > 0, 


2 
(Ми +I + uy 


The equality in (60) describes the structure of the function 7 better than the original 
definition. 


y(u) = (60) 
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1 


Fix € > 0. Then, using (60) and the inequality 1 < y=, we get 
Ф((1—)и) < : < : 
— £)u) < UP ce 5 
a- 3 (Ju: va) (1 — є)(Ми + 1-4 Vu) 
- — 
= = и). 
-e(vutit+ Ju? 1—є 
Similarly 
1 
1 —— : 
v( + €)u) > 14:709 
Therefore, 


1 1 
TQ) < Y + £)u) < YA — ә)и) < — yu). (61) 
l+e 1—e 
It follows from (58) that for every є > 0 there exists х„ > 0 such that 
(1 — e)po(x) < р(х) < (1+ €)p2(x) 
for all x > xz. Since the function 7 decreases on (0, оо), we have 


WC + e)p2(x)) < V) < YA — e)p2(x)) 


for all x > xs. Now, using (61), we obtain 


1 1 
Tre 7 026) < YŒ) < Toe (269) 


for all x > xz, and (59) follows. 


Finally, it is not hard to see that (56), (57), and Lemma 2 imply (55). 

This completes the proof of Theorem 9. 

The next example shows that condition (54) does not prevent one from choosing 
different marginal laws for different components of the process (Yj, ..., Yn) as long 
as these laws have a similar tail behavior. 


Example 1 Letus consider the following multidimensional extension of the example 
given in Sect. 5.2 of [9]. We assume that for i = 1,...,7, the distribution of the 
random variable Y; is normal inverse Gaussian, more precisely, NIG(o;, Bi, Hi, д). 
It is also supposed that the parameters satisfy a; > || > 0 and 6; > 0. This means 
that the moment generating function of Y; is given by 


Mi(z) = exp ( |o? — 8? — J2 — (Bi zi +) . 
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We refer the reader to [6] for more details on the normal inverse Gaussian distribution. 
In particular, it follows that Y; has a density g; which satisfies the following condition: 


gi(K) ~ СПЦ 3e- o EK k > +оо, 


where C; is a constant. Using Theorem 2 in [9], we see that — log G;(—k) € Ra as 
К — +оо, and also 


log G;(—k) ^ — log gi(—k) ~ (Gi — oj)k, К — +оо. 


Therefore, the condition in (54) holds with А; = a; — fj; and G(k) = е^. 
Assuming that the dependence structure of (Yj,...,Y,) is described by the 
Gaussian copula with correlation matrix R, we see that 


I(—kyT 7 | 1 


Т 1. k > +00, (62) 


infwea, wt ow 
where the matrix & = [Xj] is such that 

Rij 
J (ai — Bi) (aj — B) 


Ly = 


In other words, the implied variance is asymptotically linear, with a correlation- 
dependent limiting slope, which is given by the right-hand side of (62). 


Remark 6 In this remark, we compare the asymptotic formulas for the implied 
volatility obtained in Sects.3 and 5 (see Theorems 2 and 9). The latter theorem 
is more general than the former one. It provides the leading term in the asymptotic 
expansion of the implied volatility under certain restrictions on the marginal distri- 
butions of log-returns and the corresponding copula, and applies to many special 
models. In the case of a Gaussian copula and log-normal marginal distributions, 
all the conditions in Theorem 9 are satisfied, and the leading term is equal to the 


constant (А! Tec Ал) 2, This follows from Theorem 9, the second equality іп 
formula (27), and the second statement in Theorem 8. The advantage of Theorem 9 
is its generality, while the disadvantage is that the asymptotic formula for the implied 
volatility contains only the leading term and no error estimate. On the other hand, 
Theorem 2 applies only to the case of Gaussian copula and lognormal margins under 
а not very restrictive condition (A), but provides a sharp asymptotic formula for the 
implied volatility with several terms and an error estimate. 


For the sake of completeness, we include a proposition that is a counterpart of 
Theorem 9 in the case of the right tail. This proposition turns out to be somewhat 
trivial: the leading order of the implied volatility is determined by a single component 
with the fattest tail, and it does not depend on the copula. Let us denote by G; the 
survival function of Y;, i.e., the function G;(x) = P[Y; > x]. 
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Theorem 10 Let o > 0, and suppose that the following assumptions hold: 


e There exists = > 0 such that E[e  * Yi] < oo for i = 1,...,n. 
e For each i = 1,...,n, the function k œ> — log G;(k) belongs to the class Ra at 
infinity. 
Then, 
I(kT 1 = 
n ~ ab | 1 P max log Л] ask + +оо. (63) 


Proof Set X; = viel. Then we get 


PIX +--- +X, > х] > max PP[ X; > х], 


1 


PIXI + + Xn > x] < PIB : Xi > 1 < у PIX; > >] < n max PIX; > 7]. 
I 


Since for each i, the function log G; is regularly varying at infinity, it follows that 
the function x — log P[X; > x] is slowly varying, and therefore, for x sufficiently 
large and any є > 0, 


max log PLX; > x/n] < (1 + =) max log P[X; > x]. 
t 1 


Finally, 
..  logP[Xi +--+ Xn 2 x] 
lim = 1, 
хэ+оо тах; log P[X; > x] 


and formula (63) follows from Theorem 1 in [9] with а similar proof to that of 
Theorem 9. 


Numerical illustration In this part of the paper we illustrate the asymptotic result of 
Theorem 9 with a numerical experiment. Figure 4 plots the squared implied volatility 
of two basket call options computed by Monte Carlo as function of the strike price on 
logarithmic scale, as well as the theoretical asymptote with slope given by Theorem 9. 
Once again, we focus on the left wing of the smile since the slope of the left wing 
depends on the correlation of the Gaussian copula while the slope of the right wing 
does not. 
In both graphs, the basket contains assets with price processes 


j i ju; TX! 
Sr = Spe T 
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х x Monte Carlo x x Monte Carlo 
0.09} — Theor. asymptote a 0.09 F — Theor. asymptote 


>, 
X 
0.07 Osse. 


60 70 80 90 100 60 70 80 90 100 
K, log scale K, log scale 


Fig. 4 Implied volatility of a basket call option together with the theoretical asymptote. Left option 
on a basket of 2 identical assets. Right option on a basket of 10 identical assets. The logarithms of 
asset prices follow the variance gamma model with dependence given by a Gaussian copula 


where Д; is chosen so that [51] = 5 and XÍ is the variance gamma process with 
characteristic function 


Ep ={1—икг!ш + E 


For the numerical illustration we take jjj = 0, c; = 0.3 and к; = 10 for all i. 

In the left graph, the basket contains two assets; we take Si = 50 fori = 1,2 
and assume that the assets are independent. In the right graph, the basket contains 
ten assets; we take Så = 10 for i = 1,...,10 and assume that the dependence 
structure of the assets is given by the Gaussian copula with correlation matrix R 
with elements Rj = p + (1 — p)1j=;, where we took р = 0.5. The maturity of the 
options is T = 0.2 years in both graphs. 

In the variance gamma model, similarly to Example 1, it can be shown (see e.g., 
[1]) that 

log С; (К) ^ (oj + Bji)k, k — —oo 


with coefficients a; and 3; given by 


2 
Hi Ki Hi 
ai=. |- t> aud B= 5. 
0; Oj 0j 


Therefore, the limiting behavior of the implied volatility in this model is also given by 
(62). We see that the numerical illustration agrees well with the theoretical prediction. 
Compared to the numerical example of Sect. 4, we see that the slope of the implied 
volatility is steeper in the multidimensional VG model than in the copula model. 
This happens because the multidimensional VG model introduces additional positive 
dependence between the assets through the common time change process. 


N 
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Proof of Lemma 1 


The function F satisfies 


XwtBwt 


F(t, w) = 0t + Awt (1 + ut) — 
(t, w) max{ + Аш (1+ pt) 5 


where 1 stands for the n-dimensional vector with all elements equal to 1. Therefore, 


max F(t, = max F(t 
mag (t, w) тах (f, и), 


with 
u- But 
2 


F(t,u) = (0t + u 0 + pt) — y 


Since for every t > 0, F (t, и) is strictly concave in и, there exists a unique u(t) € RA 
with u(t) Æ О such that F(t, и) = max,egn F (t, u). This in turn implies that ikete 
exists a unique w(t) such that F(t, w) = maxwea, F(t, w). It is also easy to see 


that u(t) depends continuously on f. _ 
Let f(t) = F(t, u(t)). We would like to show that f is differentiable in t and 


compute its derivative. u(t) may be characterized as follows: for i = 1,..., п 
[1 + ut — tBu(t)]; =O if u(t); > 0 (64) 
[1 + ut — t8u(t)]; <0 if u(t); = 0. (65) 
Let 7 (t) denote the set of indices i € {1,..., n} such that u(t); > 0, and, for a 


vector x € IR", let хуг) denote the subset of components of x with indices in / (f): 
Xr) = {xi : i є I(t)}. Furthermore, let 93), г) denote the submatrix of the 
covariance matrix, containing the elements bj with i € I (f) and j € I (t). Then, the 
vector u (t) satisfies 


- Lo | 
ито = Bios Turo. up =9, 


where the set I(t) contains the indices i € {1,...,} which are not in 7 (f). 
Now, fix t € (0, oo) and for t’ є (0, оо), define 


1 
v(r)rq) = 7 Bro, io uto. vp = 0 


First, assume that for all i such that u(t); = 0, either [1 + ut — tBu(t)]; < 0 (with 
strict inequality) or 
[1+ pt’ — 198200); = 0 
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for all t’ € (0, oo). We shall call this Assumption 1. Then we can find 6 > 0, such 
that for every t’ € (0, оо) with |r’ — t| < б, v(t’) satisfies the characterization (64) 
and (65). Therefore, v(t’) = u(t’). This means that 


= 1 = 
F(t’) = 0 + S UT ш” тоу лоу t+ ut). 
Therefore, f is differentiable at ¢ with first derivative given by 


doe NNUS MM, I uh Bo! =@— u0 a 
где — a Па) Prog ro + O ONO O = zO A- ш) 


(66) 
and second derivative 
cll а 1 1} 99-1 1 
Ў O= зто 8л: 
Now assume that there exists at least опе ; such that u(t); = 0 and [1 + pt — 


tBu(t)]; = 0, or, equivalently, 
[1+ ut' — 96v(1)]; = 0 


with г = t. The case when the above equality holds for all г' is covered by Assump- 
tion 1. Since the left-hand side is linear in ¢’, this means that for a given index set 
I (t) and for a given i, there exists only one t’ € (0, oo) which satisfies the above 
equality. Since the number of possible index sets is finite, we conclude that there is 
at most a finite number of elements t є (0, oo) which do not satisfy Assumption 
1. But then, we can conclude by continuity that f is strictly convex (which entails 
uniqueness of 7) and differentiable for ай ғ є (0, oo), with the derivative given by 
(66) or alternatively by 


1 (@( и)? 


"MOL 27 w(t) + Bw(t) ^ 29 (0 -98w(t) 


Comparing this with the derivative of f, which is easily computed, we see that at the 
point f, these derivatives coincide. Since this point is characterized by the first order 
condition f’(7) = 0, and the function f is strictly convex, f also attains its unique 
minumum аї f. 
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Abstract We consider a basket or spread option based on a multi-dimensional local 
volatility model. Bayer and Laurence (Commun. Pure. Appl. Math., 67(10), 2014, 
[5]) derived highly accurate analytic formulas for prices and implied volatilities of 
such options when the options are not at the money. We now extend these results to 
the ATM case. Moreover, we also derive similar formulas for the local volatility of 
the basket. 


Keywords Basket options * Spread options * Implied volatility * Asymptotic 
formulas • Heat kernel expansion 


2010 Mathematics Subject Classification Primary 91G60 - Secondary 91G20, 
58J90 


1 Introduction 
For a local volatility type model for a basket of stocks, whose forward prices are 


given by 


To the memory of Peter Laurence, who passed away unexpectedly during the final stage of the 
preparation of this manuscript. 


C. Bayer (BX) 
Weierstrass Institute, Mohrenstrasse 39, 10117 Berlin, Germany 
e-mail: christian.bayer @ wias-berlin.de 


P. Laurence 
Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, 
New York, NY 10012, USA 


P. Laurence 
Dipartimento di Matematica, Università di Roma 1 Piazzale Aldo Moro, 2, 
00185 Rome, Italy 


© Springer International Publishing Switzerland 2015 213 
P.K. Friz et al. (eds.), Large Deviations and Asymptotic Methods 

in Finance, Springer Proceedings in Mathematics & Statistics 110, 

DOI 10.1007/978-3-319-11605-1 7 


214 C. Bayer and P. Laurence 


dF;(t) = oi (F; (dW; (t), i —1,...,n, (1.1) 
d(W;, W;)(t) = pydt, i,j =1,...,n, (1.2) 


with a given correlation matrix p, we consider basket options with a payoff 


n En 
Р(Е) = (Soe - к) ; 


1=1 


where we generally denote іп bold face a vector of the corresponding italic com- 
ponents, as in Е = (Fi,..., Fa). Since we only assume that at least one of the 
weights w,,..., wj, is positive, we will refer to options of that type as generalized 
spread options. 

The purpose of this paper is to provide an explicit first order accurate short time 
expansion of the price Cg(Fo, К, T) of the above option using the heat kernel ex- 
pansion technique (see, for instance, [12, 13, 20]) when the option is at the money. 
Moreover, from the asymptotic formula for the option price we also obtain an asymp- 
totic formula for the implied and for the local volatility.! Thereby we complement 
the results obtained in [5], where a first order accurate asymptotic formula was given 
when the option is not at the money. (The zero order accurate formula is well-known, 
see, for instance [1]. When the option is not at the money, alternative first order ac- 
curate results can be found in [12].) Such asymptotic formulas are highly relevant, 
in particular when the dimension of the model is high (say n > 3), since then tradi- 
tional (simulation or PDE) techniques to compute Cg fail or are at least very time 
consuming. In fact, for a wide range of different parameters, [5] show numerically 
that their asymptotic formula is remarkably close to the true price as given by the 
model, even for not so small maturities Т (like 5 or even 10 years), for dimensions of 
up to n = 100 (or even more). The same holds true when the option is at the money, 
see Sect. 6. 

We now sketch the procedure for deriving the asymptotic formulas, highlighting 
the differences to the non-ATM case. 


e In the first step, we derive a Carr-Jarrow formula for the basket option price, 
separating the price into the intrinsic value of the option (which vanishes in the 
АТМ case) and an integral over the arrival manifold (^; w; F; = K} with respect 
to the transition density p (Fo, Е, Т). This is done in Sect. 2. 

e The first terms in the heat kernel expansion of p(Fo, F, T) are computed. In the 
non-ATM case, a zero-order heat kernel expansion was sufficient to get first order 
accurate formulas for the implied volatilities. At the money, we actually need to 
add one additional term in the heat kernel expansion. The heat kernel coefficients 
are computed in Lemma 3.6. 

e The aforementioned integral on the arrival manifold is essentially an integral with 
respect to the rapidly decaying kernel exp (—4(Fo, Е)2/ (2T)), where d denotes 


I Since we consider spread options here (for which © į Wi Fo,; may be negative), we derive implied 
volatilities both in the Black-Scholes and in the Bachelier sense. 
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the Riemannian (geodesic) distance induced by the stock price process. Hence, 
the integral can be approximated using Laplace’s expansion for Т — 0, which 
involves the minimizer F* of Е к> d(Fo, F)? subject to >”, w;F; = К. In the 
general case, this minimizer has to be computed numerically, while it is obviously 
given by F* — Fo when the option is at the money. On the other hand, the formulas 
are much longer and more complex due to the higher order heat kernel expansion 
used, see Proposition 3.4 together with Lemmas 3.3 and 3.7. 

e In Sect.4, we use the same Laplace's expansion technique to derive the local 
volatility of the basket, see Proposition 4.1. 

e Finally, in Sect. 5, an asymptotic expansion for the implied volatilities is computed 
by a comparison of coefficients between the asymptotic expansion of the basket 
price derived in Proposition 5.1 and asymptotic expansions of the Black-Scholes 
and Bachelier formulas, respectively, see Eqs. (5.2)- (5.4). 


An alternative way to derive the asymptotic expansion for at-the-money options 
would be to start from the non-at-the-money formulas and pass to the limit. This 
would involve un-determined terms «б», which would need to be resolved by the 
l Hopital rule. In particular, we would have to compute limits of derivatives of the 
optimal configuration, which are not known in closed form when the option is not 
at-the-money. Still, one could follow that approach using similar techniques as in 
[4], but the derivation would hardly be any simpler than directly starting from scratch 
again (the course of action chosen in this article). 

In Sect.6 we present numerical examples for one particular choice of a lo- 
cal volatility model, namely ће СЕУ model, corresponding to o;(F;) = &: F E Я 
0< ĝi < 1,1 <i < п. The numerical observations supports the claimed accuracy 
of the asymptotic price formulas. In fact, comparisons with highly accurate reference 
solutions show that the asymptotic formulas indeed have the suggested rates of con- 
vergence as Т — 0. Even more, they indicate that the formulas, in particular the first 
order formula, are highly accurate even for large maturities such as Т — 10 years, 
thereby confirming the observations in [5]. 


Remark 1.1 Inthe same spirit as [5], the aim of this paper is to give informal deriva- 
tions of fast and accurate asymptotic formulas. Indeed, there are several steps, in 
which our derivations are not fully rigorous. In particular, most local volatility mod- 
els (like the CEV model) exhibit singular behaviour at the boundary of the domain 
R^ which can inhibit the validity of the heat kernel expansion, and, a fortiori, also 
the Laplace expansion applied later. It is clearly possible to rigorously justify both 
expansions under appropriate (uniform) ellipticity assumptions (see, for instance, 
[20] for the validity of the heat kernel expansion and [6] for a rigorous version of 
Laplace's expansion). An extension to general or some specific local volatility mod- 
els, however, seems to be a difficult task, see also the comments in [5, Sect. 4], and, 
in particular, the results of [2]. Thus, we believe that a more “hands-on” approach 
can be justified in this particular case. For related problems see also [3, 8, 9]. 
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2 Basket Carr-Jarrow Formula 


Consider a basket В = У w; F; with weights w; € R. Following [5, 7], we are now 
going to derive a Carr-Jarrow formula for the price of a generalized spread option on 
the basket, 1.e., a decomposition of the price of the option into the intrinsic value and 
an integral over the arrival manifold {8 = К}. Take the Itó derivative of the basket’s 
price: 


d > wi FO) = У wici(Fi()dW;() 


i=l 1=1 


У wiwjoi Fi oj (Fj) ру dW(), 
i,j=1 


2 
TN,8 


for a new Brownian motion W. Here we have used the notation см, в to indicate the 
"normal volatility" of the basket which must not be confused with the lognormal 
(Black) volatility св = 7*5 used in reference [1]. Therefore, by the Itó-Tanaka 


n 
Wi F; 
=1 


i 


formula we have 
n + n 
d (> wi Fi(t) — к) =} wilg wrok dO 
ї=1 i=l 
1 2 
T 59t&:x: и F (D) K) £N, (F (t))dt. 
Integrating we obtain 
n "E n + n T 
(> w;Fi(T) — ? = (> w; F;(0) — x) + Èv f Lyr G)- k Еи) + 
i=] j—] j=l 0 
1 fT Р 
T 2o бүе(и):У wi Fja)- К) £N g (FG0)du. 
Letting 6x = {F € R^ : >) wj Е; = K} and taking conditional expectations with 


respect to the filtration Fo at time 0, we obtain, assuming Р; (7) is a martingale for 
;2 
each i^: 


2а many cases of interest, Е; (1) is only a local martingale and not a martingale. But the discrepancy 
is not "felt" for short times, since the set of paths that can reach the boundary have small probability, 
in this limit. This is known as the principle of “not feeling the boundary" for small times and is born 
out by our numerical results. More surprisingly the boundary is not felt, even for quite large times. 
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+ 
п 1 T 
Ca(Fo, K, T) = (> w; Е,(0) — к) n" ;] E [os бек (80 | dt. 
i=l 0 


Letting u(F) :— У" ,wiF; and recalling |Vu| = |w| (where | - | denotes the 
Euclidean norm) we can re-express this as 


n + 
Св(Ео, K, T) = (> и F; (0) — к) + 
ї=1 
11 rp $ 
*5ui], f Уи (Б) сх g(F)0o (и (Е) — K) p(Fo, F, t)dFdt. 
2|w| Jo JR | 


By the co-area formula (see [10]) 


f ivucoteeoas = f / g(x) Hy-1(dx)ds 
Q —oo J u-l((s]) 


(where Н„—1 denotes the Hausdorff measure on u^ ! ((s])), we arrive at 


n + 
Св(Ео, K, T) = (> w; F; (0) — к) + 
ї=1 


T оо 
Баа И óo(s — К) n c^, g(F) p(Fo, Е, г) H, i (dF)dsdt 
2 |w] Jo /—% es C 


n s 
= (Èuro — к) + 
ї=1 


1 (T1 
+ J E | Tk. F, D) Hi (dF)dt. 
2 Jo MF | 

К 


Note that У,у coincides with the (n — 1)-dimensional Lebesgue measure on бк. 


Proposition 2.1 The value of a call option on a basket B is given by 


n ab 
Св(Ео, K, T) = (> wi F;(0) — к) + 
i=l 


1 [7 1 : 
+ | — | У wwijosi(Foc;(Gpgp(Fo Fu) H,-i(dF)du. (2.1) 
2Jo |w] 2 

Ex #7 


Using the formula for the basket's local volatility, [1, 12], expressed in the notation 
introduced above, after canceling common factors we also have the 
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Proposition 2.2 The local volatility of the basket option is given by: 


n 
Г У, wiwjai(Fo; (Fj)pijp(Fo, Е, T) H, 1 (dF) 
&K) i, j=1 


2 2 

K,T)K? = 
gj (K. T) Г po. F, T) H, i (dF) 
&(K) 


3 A General Asymptotic Expansion Procedure 


The starting point is the basket Carr-Jarrow formula derived above for the calculation 
of the option prices as in Propositions 2.1 and 2.2 for the calculation of the local 
volatilities. The next step is to approximate the transition density there using the heat 
kernel. For reasons that will become clear in the course of the asymptotics, it will be 
necessary to use the so-called geometric expansion 


2 
p(Fo, F, t) = aap Viae 9^ (uo (Fo, F) + tui (Fo, F) + 0(1)). (3.1) 


n 
ті)? 


For a detailed exposition of the geometrical underpinning of (3.1) we refer to [5, 12, 
13, 17, 20]. Here, we just give a very quick reminder. 


Remark 3.1 We shall assume that the process Е, is locally elliptic in the sense that 
р is invertible and c; (F;) > 0 for any F; > 0 and any i, i.e., for Е in the interior of 
the domain of the process F;. A rigorous heat kernel expansion for locally elliptic 
diffusions is given in [2], with the restriction that the expansion is only valid for 
compact subsets of IR^. 


The state space IR^ is equipped with a Riemannian metric by defining the inverse 
g | of the metric tensor by 


8” (Е) 2oi(Fpjo;(F), 154, ј <n. 
Hence, the metric tensor itself is given by 
gj(F) = o;(F) pto, 151, ј п, 


with determinant 


det g(F) = det (57!) П ox Fe) 
К=1 


(where р? denotes the (i, j)-component of the inverse matrix o^! of the correla- 
tion matrix р). The (geodesic) distance between two points Fo and Е is denoted 
by d (Fo, F). 
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The specific form of these quantities in the setting of local volatility models has 
no relevance in our initial asymptotic derivations, which can be obtained for generic 
versions of these. So, to lighten the notation and streamline the presentation, we 
first derive the asymptotic expansions without any specific reference to these and 
then plug in the specific form only at the end of the process in order to produce the 
required concrete asymptotic expansions. 

Plugging the heat kernel expansion (3.1) into the expressions in Propositions 2.1 
and 2.2, respectively, we see that we have to compute expressions of the form 


1 d(Fo, F)? 
эл, WE) esp (2) H,— i (dF), (3.2) 


where 
W(F) = п; (Fo, F) :— ydet g (F)o5, g (F)u; (Fo. F), :=0,1, (3.3) 


for the option price and for the numerator in Proposition 2.2 and 
V(F) = f; (Fo, F) := ydet g(F)u; (Fo, F), i = 0,1 (3.4) 
for the denominator in Proposition 2.2. 
The integral on the n — 1 dimensional subspace € of R” can be transformed into 


an integral over R"-!, by eliminating one of the variables. We choose to eliminate 
the nth one, using the payoff 


1 п—1 
hs fa (к-а). (3.5) 
n і=1 


Denoting 


G= (Fi... Ел) € RE, 
п—1 
Ск = [cen : Sw = «| А 
1=1 


so that for our hyperplane's intersection 


п—1 
1 
ex ong = [ren Н e- (s z (s- > Т); 
Wn n 
i-l 


Note that the set Gx is introduced in order to ensure that Р, in (3.5) is non-negative, 
as it needs to be. The set 6x is an n — 1 dimensional hyperplane in К”. 

Note that, when we parametrize the hyperplane & using (Fi, ..., F4 1). as 
in (3.5) 
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Ек Pigs ng ba) = (Ё\,..., Ies Pa Pigs Ёл, R2) 
we will always assume that the weight multiplying F, is positive. This can always 


be achieved by choosing as the nth asset one of the assets with a positive weight. 
Then for the surface measure, we have 


Ww 
ан, 1 = V1 + |V F,|?dFı...dFa—1 = Mr ар. 
W 


Wal 
In this notation, with A = E the integral (3.2) reads 


1 |w] _ A(FgF x (6)) 
t 


(2тт)"/? [ул | 
G 


У(Ек(С))аЕ TN .d Fn—1 = 
K 
1 [wl 


Qa wal J * 
Ск 


Ф(6) 
т W(G)dG, 


(3.6) 


using the notation ®(G) := A(Fo, Ек (G)) and (by abuse of notation) Y (G) :— 
V (Ек (G)). We now use Laplace asymptotics for multiple integrals. The main con- 
tribution comes from a neighborhood of the minimum point. 


G* = arg min d? (Fo, (С, F,(G, K)), (3.7) 
С̧єСк 


= d’ (Fo, Ex). 
Set К® = (G*, F,(G*, K)). (Of course, when the option is at the money, we have 
G* = (Fo, s Fon=1)-) 


Order Zero. The zero-th order term in the Laplace expansion of 


Ф(С) 
/ e : W(G)dG 
Ск 


is identical to the one in [5] except that іп the present setting we have d (Fo, Fy) = 0. 
We get, as in [5] 


n=l 

"T zT 02 n= 207) Z 
tT W(G*) х | е > dzj...dz, = yan 2702 
R^-1 (det 0)? 


where О = D*®(G*) is the Hessian of Ф at the minimum point. Thus, bringing 
back the missing factor and taking into account that F} = Fo in the current (ATM) 
setting, we see that the lowest order term in the Laplace expansion of (3.2) is 
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v... |W І 


= ini ав t EO (3.8) 


Order One. For obtaining first order implied or local volatility terms in Ше АТМ 
regime, we need to push the Laplace expansion one step further, i.e., we need one 


additional term for 
$(G) 


/ ет W(G)dG 
Ск 
Hence, we apply the (multi-variate) Taylor expansion for ®(G) := A(Fo, Fx(G)) 


up to order 4 around the maximizer G*, which can be expressed in tensor notation 
as 


(G) = $(6*) + РФ(С*) (G — G*) + 15?e(6*) (G — С*)®? + 
LÁ 2 
=0 


+ ; D*eG*) (G – G*)*? + Dio’) (GG) ep, 


with 


ok 
k ek. "EPUM 
D* ®(x)y = > "5 dE o (x)yi, Yi 


Surg 


(This notation makes sense as any multi-linear map on a vector space—such as 
D*o (X)—corresponds to a linear map—here also denoted by D* ®(x)—on the ten- 
sor product space). Of course, we are aware that when the option is at the money, 
the optimal configuration is the same as the initial configuration Fo. Nonetheless, we 
think that using a different symbol for the optimal configuration at this stage leads 
to a clearer exposition of the underlying ideas. Likewise, we apply Taylor expansion 
up to second order for the map Y (С) around С“, 


1 
W(G) = WG") + DW(G*) (G — G*) + ;D^w(G*) (G – G)? +... 
In the end, we are interested in small-time asymptotics, so we change variables 


1 * 
к= ече 


so that we can express the above Taylor expansions as expansions in f, 


222 C. Bayer and P. Laurence 
1 1 * 1 2 *14,02 1 3 *) 73 
7 PG) = =Ф(6*) + ;D'6(G)77 + cD'e(G*)z Vt 


1 
+ zg De GO? + o(t), 


and 
V(G) = Ч(С*) + DW(G*)z/t + Í DW (G92? +o(t). 


Using the above Taylor expansions, the change of variables, and 


_ 2 
еам+ы =] ам + (5 + ) t + o(t), 
we obtain 


Fo.F (G " " 
n Ы 019" W(G)dG = 1"-0/2е7®6 ui e 1D D (Gz? 
Gx (Gx—G*)/Jt 


mI *1403 1f Los scu dou *j204 
d <р ®(G*)z 4l a Ф(С*)2 | a” $(G*)z*^ |: 
+ oi | x [ос + DW(G)zt + T DVG + oi | dz. (89) 


In the next step, we approximate the integral by replacing the domain of integration 
(Gx — G*)/A/t by R"-!. Then we can see that the integration kernel in (3.9) is 
Gaussian with vanishing mean, so that the integral of any odd monomial with respect 
to the kernel vanishes. Thus, we obtain the expansion 


I| |! ] _ MESE KG) "T 
[wal Отг)" ' — W(G)4G = [hy t hit oQ)]. 3.10 
wal Qut Je, * (G)dG = [hy +hyt+o@)] (3.10) 


with Л defined in (3.8) and 


v. Iw] 1 —1z"oz| 1 2 * e 1 3 *14,03 
һу : = мы бей fae 2 zD YGZ ер?Ф(С?)2 


2 
x DW(G*)z + ; СЧ V(G*) — Poe) a 
(3.11) 


Here we assume that V is polynomially bounded and Fo > 0 (i.e., all components 
of Fo are strictly positive. Indeed, under these assumptions we observe that the error 
in the approximation (3.10) decays, in fact, like e~!/’ by properties of the normal 
distribution. 

Using Isserlis’ Theorem (see [14]), the Eq. (3.11) for hr can be computed ex- 
plicitly. 
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Lemma 3.2 (Isserlis’ theorem for fourth and sixth moments) For a covariance 
matrix X € М4 let T?(X)) є (R4)8^ and T? (£) є (R)9 be the tensors defined 
by 

i4 = Xi, Mii, F Miis Xii, + Xi, Miis 


ay 


20 i6. = Xii Xii Xii + У Xii Уі + У Xie Zigis 
+ Miis Я Xii. + Xii Xi; Lisig + Xii Xie Zigis F Xii Xi Lisi 
+ У Diis izis + Ул 6 Уру Miis iiij iis 
+ 1; Ving Vigis H Эу із Vigis + Эв Vinig Diis + Viyig Vinis Vizig, 


1<i,...,i6 € d. For ~ МОО, X) we have 
E Gal =T(5), E [z2] = T). 


Hence, we can get an explicit formula also for AT in terms of derivatives of V 
and $—which are easy to compute, but lead to quite long formulas that depend on 
the individual choice of the local volatility model. These formulas are not included 
here, as they essentially boil down to exercises in the product rule. 


k 
Lemma 3.3 With the short-hand notation д; і, ‘= — we have 
i ik 


D^w(G5go^! 


рУ [м | 1 1 
и | J2nt det Q | 2 


з» 


These results are summarized in 


Proposition 3.4 Assume that we have a locally elliptic local volatility model such 
that the heat kernel expansion (3.1) holds, initial stock prices Fo are strictly pos- 
itive, and V is polynomially bounded. Moreover, we assume that the minimization 
problem (3.7) has a unique solution. Then we have the Laplace expansion 


2 
mu V(F) exp (-“® Н,_1(аЕ) = һу + thy + o(t) 
T EK 
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with he given in (3.8) and hr given in Lemma 3.3. 


Remark 3.5 We note that the key assumptions of Proposition 3.4 are not easy to 
verify in general. We refer to [5] for elements of a proof of the heat kernel expansion 
and to [3] for further discussion. 


The last ingredient needed for the asymptotic expansions of both implied and 
local volatilities are the heat kernel coefficients uo and иј. As we are assuming the 
options to be ATM, we only need the heat kernel coefficients on the diagonal. 


Lemma 3.6 For a local volatility model, we have the following formulas for the 
heat kernel coefficients on the diagonal: 


uo(F, Р) = 1, 
Bn EE. IFoo (Е: 
ШЕ, F) = 7» o0(F)or(F) - 3 У, oilo Qj), 
i=l 


i,j=l 


where, as usual, p denotes the (i, j)-component of p~'. 


Proof Note that the infinitesimal generator A of the process F(t) can be expressed 
(using the summation convention) as 


А, Leg 2 
N^ 57 дЕ, 


where 
1 


det g 


таа 


denotes the Laplace-Beltrami operator associated to g and the vector field f is given 
by 
fif) = (ЕЮ) (Б), i=1,...,0. 


As indicated in (3.1), the transition density of the process F(t) satisfies (under certain 
assumptions, see Proposition 3.4 and Remark 3.5) 


d(Fg.F)? 
p(Fo, F, T) — v det g(F)e 3T — (uo(Fo, F) + Ти (Fo, F) + 0(T)), 


1 
(2тТ)"/? 
where d (Fo, Е) is the geodesic distance between Fo апа Е and uo and иј are the heat 
kernel coefficients. 

The order zero heat kernel coefficient is given by uo(Fo, Е) = 4A(Fo, F) 
1 : 
eih +48) , where de ( f. Yg) is understood as integral along the geodesic y joining 
Fo and F and A(Fo, F) is the Van Vleck-De Witt determinant, 
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1 | 1 242 (Fo, 2) 


A(Fo, F) = 
(Fo, E) det е (Ер) det е (ЕЁ) 2 OFOF 


On the diagonal, we clearly have J ( f. Ae) — O and for any local volatility model we 
have A(Fo, F) = 1, as the geometry is isomorphic to the Euclidean geometry by the 
coordinate transformation F +> Ly, where LpL? = Id and y; := JE oi(u) аи. 
Hence, u(F, F) = 1. 

For the first order heat kernel coefficient, we refer to [16, Eq. (4.1)], where it is 
shown that 


1 Lc. 1 2 
и(Е,Е) = zr + 7 dive /(Е)— z Lf (DI; 
Неге, к denotes the scalar curvature, which vanishes for local volatility model due 
to the isomorphism with the Euclidean geometry already used above. (Note that [16] 
consider the heat kernel corresponding to A + f, whereas we consider the operator 
5A + 1 f. Hence, we evaluate the formula obtained in [16, Eq. (4.1)] at t/2 instead 
of t.) For the remaining terms we have 


| _ 1 pq. ee те 
dive f) = aaa өр; [Ede = oF (М), 
Lf (IZ = (Б) fi (E) f;(F) = о; Fl) o^ (Еу). п 


Finally, we can explicitly compute the determinant of the Hessian О of Ф at 
G* = (Fo, ..., Fo.n—1) in the ATM regime. 


Lemma 3.7 The Hessian О of ® satisfies 


У у=1 Wii Fo, piwjo j(Fo,j) 


det О = 
we det pIE- oi (Fo? 


= оўу g (Fo) det g(Fo)/w2. 


The proof of Lemma 3.7 is deferred to the Appendix. 


4 Basket Local Volatility 


The numerator in the right hand side of the formula in Proposition 2.2 is given by 


1 


d (Fo, F)? 
(2nty"/? 


| (био, P) + йг Fo. F) exp (- 
J&k 2t 


Jaia = 
һб Et Ui + ng) + o(t), 


where, by abuse of notation, we denote the function F +> и; by uj again, i = 0, 1. 
For the denominator, we get 
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1 


d (Fo, F)? 
Qnty'? 


| (dio Fo. F) + tá (Fo, Е)) exp ( 77 


) Ava) = 


ng + (i? +) + oc. 


а +bit+o(t) а | arbi — abo 
а + bot + o(t) a2 a; 


t + o(t), 


we arrive at 


h” pio h”? +h” — hp” Ж 4 pit 
Oloc(K, T? K? = о + o ti 0) о Ut 0) 
0 


К) 


AS по = сі, gio, we can easily simplify 


Т + о(Т). 


uo 
0 2 
=, Fo). 
jio N.:8C 0) 
0 


For the first order term, we note that all the terms he? and hi have the common 
factor 11. ттт , Which, hence, cancels out in the first order term—in particular, 


implying that the “first order term" is really first order in T. Thus, we get 


Proposition 4.1 For К = Fo = У” уи Ер, the basket local volatility has the 
asymptotic expansion OR AT, K)= PES o CK) + Ore (K)T + о(Т), with 


2 
on, g(Fo) 
2 N,8B 
Оос, CK) m К? , 
pio (go +h”) u jo (no + nity 
2 0 1 0 0 1 0 
Foc, CK) = aw) А 
(n) к? 


We recall the definition 


n 
ом.в(Е)? = У wioi(Fi)pywjoj(F)). 
i,j=l 


5 Implied Volatility 


The strategy for obtaining an asymptotic expansion for the implied volatility is as 
follows: we first compute an asymptotic expansion of the basket option price in our 
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local volatility model, then we compare coefficients with the short time expansion 

of the corresponding call option price in the Black-Scholes or Bachelier model, 

respectively. Hence, we first apply our general asymptotic expansion obtained in 

Proposition 3.4 to the Carr-Jarrow formula from Proposition 2.1, getting (for K = Fo) 
Now we can insert these results back into Proposition 2.1, and we obtain 


ба ы К c [ (nio + (ni? + у) oe) at 
2 |w| Jo 


-i[ EO, Ji (gi af?) нос dt 
2 0 At 1 0 


3 Ї ОЕ е 
йо УТ + с (ei^ + sj) T?? +0 (тэ) ; 


Ш 
9 
о 


where 

om ГБ, 

ERU еви (5.1) 
І |w| 1 


is independent of т. Finally, using (3.8) together with (3.3), and Lemma 3.7, we get 


i = oy g(Fo)detg(Fo) ол. в(Ео) 
0 [ил | /27 det О Jin ` 


Proposition 5.1 The expansion of the call prices (at-the-money) in drift-less local 
volatility models is asymptotically equivalent, to first order, to 


c F l/ i 7 
Св(Ео, К, T) = m +5 (809807) 7°? + or 
T 


as T — О. 


In the final step, we compute an expansion of the implied volatility with respect 
to either Black-Scholes or Bachelier model. Let us consider the prices of call options 
with stock price Fo= Xai wi Fo,; = K inthe Black-Scholes and Bachelier models, 
assuming that the respective volatilities are of the form ogs = ogs,o + Tops, and 
C Bach = OBach.0 + 1 C Bach.1. We obtain the well known formulas 


Cgs(Fo, K, T) =Cps(K, K, T) = 
K 


cgsov T + 
J 2T 


ES Е : ORs ] + о(Т??), 
/ 27 24 x 
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Cog (F9, K, T) = Cass (E, K, T) = 
K K 
= O Bach,0'N T+ ован + acr, 
T 


Ут “т 


5.1 Zeroth Order Implied Volatility 


Despite being well-known, we recall the zeroth order implied volatility coefficients 
and some of their properties. By comparison of coefficients, see Proposition 5.1 and 
the above expansions for Cgs and Свасл, respectively, we find that 


1 . =i А 
CBS,0 = OBach,0 = —— 40 (Fo, Fo) (det Q) 2 =————, (5.2) 
[уел | К Fo 


where we also used Fo = К. Note, in particular, that the basket implied volatility 
(5.2) can be interpreted as a weighted mean of the individual components' (ATM) 


А : "A 2. Nn Foi i Fej j 
implied volatilities in the sense that (свѕ,0) = Èi j= pijWi ORS Wj -K CBS.0" 


Remark 5.2 The right hand side in Eq. (5.2) is nothing but the local volatility of the 
basket bom wi Е; at Fo in the Black-Scholes (i.e., log-normal) sense. Hence, we 
have obtained that the zero order term in the small time expansion of the implied 
volatility of the basket is equal to its local volatility when we consider an ATM option. 
That result is not surprising in light of [11], where similar results were obtained (in 
one-dimensional models). In this sense, one could even take (5.2) as an ex-post 
justification of Lemma 3.7. 


5.2 First Order Implied Volatility 


The first order implied volatilities in the Black Scholes and the Bachelier model do 
not coincide any more. Indeed, we immediately have the first order correction term 
in the Bachelier model 


A27 (i 7 
ogani = S (e ep )- (5.3) 


On the other hand, for the Black-Scholes model we have 


V2T ( dg , i ORs.0 ORs.0 
= —_ — = ———, 5.4 
OB$1 = =. б + 80 )+ 24 C Bach,| + 24 (5.4) 


implying that implied volatility quoted in the Black-Scholes framework is strictly 
larger than the implied volatility in the Bachelier framework up to first order—the 
prices are, of course, equal up to first order. 


Small-Time Asymptotics for the At-the-Money Implied Volatility ... 229 


6 Numerical Results 


6.1 The CEV Model 


As in [5], we consider the CEV model for the numerical examples. The CEV model 
is a special case of the general local volatility model considered so far, where the 
local volatilities are given by 


Bi В 
о:(Е) =EF;', і = 1,...,п, 


for some parameters €; > 0 апа fj; > 0. In fact, the most realistic scenario here is 
0 < 0j; x 1. Note that we allow 0; < 1/2, which implies degenerate densities of Е, 
at the boundary. 


6.2 Implementation of the Approximate Formulas 
and Simulation 


Implementation of the zero order terms of the implied volatilities in either Black- 
Scholes or Bachelier setting is, of course, easy using (5.2). On the other hand, the 
formulas for ops, т and ogach,1 are much less straightforward to implement. While the 
formulas in the ATM case are fully explicit (unlike in [5]) an efficient implementation 
is much less trivial. The formula for hı іп Lemma 3.3, for instance, depends on the 
derivatives up to order four of the squared Riemannian distance at Fo and on the 
Jacobi matrix of F +> ug(Fo, Е). Already the evaluation of the (n — 1) x (n — 1) x 
(п— 1) x (n— 1) tensor D*® can be very time-consuming, if a naive implementation is 
used, which does not take into account that most derivatives actually vanish. But even 
when more efficient implementations are used, the sheer size of the tensor may impose 
limitations on the dimension of the problem. So far, we have implemented (3.11) in 
Mathematica using symbolic differentiation of the squared Riemannian distance and 
the zeroth order heat kernel coefficient uo, which works for small dimensions, up to 
n — 5, say. 

As in the paper [5], we compare the approximate prices against prices obtained 
from sophisticated Monte Carlo simulation. Here, the CEV-SDE is discretized using 
the Ninomiya-Victoir scheme [18], which is a second order weak approximation 
scheme based on a splitting of the generator. Strictly speaking, the CEV process 
violates the strong regularity assumption of that scheme, especially at the boundary 
of the domain, but, as often in equity modelling, we do empirically observe second 
order convergence for CEV-baskets, yet another beneficial effect of "not feeling 
the boundary". For variance reduction, we combine the discretization with the mean 
value Monte Carlo method, see [19]. This is a variant of the control variate technique, 
where a linear combination of one-dimensional geometrical Brownian motions is 
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used as control variate. More precisely, we freeze each component but one of the 
basket, and replace the dynamics of the remaining basket by a corresponding Black- 
Scholes dynamics. In the resulting model, the true option price can be explicitly 
calculated. Finally, we choose a linear combination of those partially frozen model 
so as to minimize the variance of the Monte Carlo estimator. 

The expectation of the random variable obtained by combining the Ninomiya- 
Victoir discretization of the CEV process and the mean value Monte Carlo method 
is the approximated using Sobol numbers. In some sense, this contradicts the above 
motivation for the variance reduction, but we do find empirically that the integration 
error for a Quasi Monte Carlo estimator is also reduced by the variance reduction, i.e., 
the variance reduction also seems to reduce the number of most relevant dimensions 
of the integration problem. Finally, we sacrifice some of the accuracy available by the 
combination of the three techniques mentioned so far by introducing a random shift 
of the Sobol numbers, i.e., we use the Randomized Quasi Monte Carlo technique, 
see L’Ecuyer [15]. In this way, we can obtain reliable computable error bounds for 
the integration error. 


6.3 Numerical Example 


We consider a three-dimensional spread option, which is determined by the following 
parameters: 


8 0.4 0.7 -1 
Бо = | 17|, c2[08], 82[05], w=] 1 |, 
12 0.7 0.3 


with а correlation matrix 


1 0.9167390 0.7425194 
р = | 0.9167390 1 0.8099573 
0.7425194 0.8099573 1 


We compute the ATM price, i.e., the option price at K = 21, for maturities 
T € (0.5, 1, 2, 5, 10} years, which we compare with the zeroth and first order prices 
in the corresponding Bachelier model. We also report agacho = 0.1487036 and 
OBach,| = —6.72781 x 1075. Note that the “error bounds” reported in Tables | 
and 2 are upper estimates for the integration error (i.e., quasi Monte Carlo error) for 
the reference values. Hence, numbers obtained from the first order approximation 
formula are within the error bounds around the reference values. 

In Fig. 1, we plot (linear interpolations of) the relative errors of the zeroth and 
first order approximate pricing formulas close to the money (as obtained in [5]) and 
compare them to the ATM-formulas represented by circles. We see that the accuracy 
is extremely good in both cases, and that our approximation formulas for ATM CEV- 
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Table1 Prices 
Time Price Oth order price Ist order price Error bound 
0.5 0.88073 0.88092 0.88072 2.43e-05 
1 1.24525 1.24581 1.24524 4.6Зе-05 
2 1.76023 1.76184 1.76024 8.90e-05 
2 2.77895 2.78571 2.77941 3.21e-04 
10 3.91968 3.93959 3.92176 5.92e-04 


Error bounds given correspond to the (quasi) Monte Carlo error in the numerical scheme. The dis- 
cretization error is of higher order 


Table 2 Relative errors 


Time Oth order rel. error 15 order rel. error Error bound 
0.5 2.19e-04 6.85e-06 2.43e-05 
1 4.49e-04 3.80e-06 4.63e-05 
2 9.15e-04 9.02e-06 8.90e-05 
3 2.43e-03 1.65e-04 3.21e-04 
10 5.08e-03 5.33e-04 5.92e-04 


Error bounds given correspond to the (quasi) Monte Carlo error in the numerical scheme. The dis- 
cretization error is of higher order 


basket options nicely interpolate the formulas available away from the money. Indeed, 
deviations from the non-ATM values only appears at very small orders of magnitude 
in the logarithmic scale of Fig. 1 (where the Monte Carlo error contained in the 
reference values probably dominates). For the sake of completeness, Fig. 2 reports 
the absolute errors of the respective asymptotic formulas over a wide range of strike 
prices, indicating that the asymptotic formulas exhibit their worst quality ATM. 


Appendix A: Proof of Lemma 3.7 


We present a proof of Lemma 3.7. Recall that we want to compute the determinant 
of the Hessian Q of the map 


1 
Ф(6) = 5d (Fo. (G, Fy (G, K)))° 


evaluated at G = (Еол, теп Fo,n-1). Let G;(x) denote the anti-derivative of 1/0; 
satisfying (for simplicity) G;(Fo,;)=0. Now consider the change of variables Е — y 
with y; := Oj(F;j), i = 1,...,n. As verified in [5], this transformation turns the 


Riemannian geometry introduced above into an (almost) Euclidean geometry, with 


d(Fo, В)? = y ply. 
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1e-03 


Relative error 
5е-05  2e-04 


1e-05 


2e-06 


20.90 20.95 21.00 21.05 21.10 
K 


Fig. 1 Relative errors. Solid lines correspond to prices obtained from (non-ATM) zeroth order 
approximate formulas, dashed lines to (non-ATM) first order approximate formulas. The corre- 
sponding ATM-approximate prices are represented by circles and other symbols. Note that the 
option is ATM for K — 21 


Of course, the constraint on F translates into a constraint on y, which can be removed 


by eliminating one variable. Indeed, setting x :— (y1, ..., Yn—1), we get 
1 п—1 
Yn(X) = Gy (Fn) = 6, "s K — > w570) 
n 


This way, we understand Ф (G) as a function (x) in the new (reduced) coordinates, 
and obtain for the Hessian 


Hed (G) = J(G)! Axy(x) (С), 


where Hg апа Н; denote the Hessians in the G- and x-coordinates, respectively, 
and J(G) denotes the Jacobian matrix of the change of coordinates С — x. As 
e = 1/o;, we have J(G) = diag(1/o1(F1), ..., 1/on—1(Fn—1)). Regarding the 
matrix Ну, an elementary calculation using the fact that F = Fo corresponds to 
y = 0, we obtain 
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Absolute error 


1e-05 


1e-07 
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Fig. 2 Absolute errors. Solid lines correspond to prices obtained from (non-ATM) zeroth order 
approximate formulas, dashed lines to (non-ATM) first order approximate formulas. The corre- 
sponding ATM-approximate prices are represented by circles and other symbols. Note that the 
option is ATM for K — 21 


in Wj0 j (Fo, j) in Wigi(Foi) , prawa)" 


Hyo(0) = С 
T WpnOn (Роп) WnOn(Fo,n) | Won (Fon)? 


i,j=l 


From the structure of the above expression and the expression in Lemma 3.7, we see 
that we may assume that у; = 1,i = 1,..., п, and on (Роп) = 1. In this case, we 
are left to prove that the determinant of the matrix 


n-l 


A = (pl — p's; — psi + рв) jy 


is equal to the expression a :— 87 ps/ det p, where we used the short-hand notation 
s; = o;(Fo;),i = 1,...,n — 1, and s, = 1, ands = (sj,..., Sn). 

As both det A and a are polynomials in s1, ..., 5,1, we prove this equality by 
establishing that they have the same coefficients. Here, Cramer's rule is the essential 
tool: 


1 
B! = — Adj(B), 
det B |5) 
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where the adjugate matrix Adj B is the transpose of the matrix of co-factors, i.e., 
; = +] m 
(Adj B);; = (—1)'! det Вз, 


with В being obtained from В by removing the jth row and the ith column. By 
symmetry, we hence have 
Pij 
det p 


= (—1)+Ј det рг, v(i, j) € {1,... n — 1%, (A.1) 


where pre is understood in the sense of (p s. 

Let us also establish a few notations. Let S,.., be the set of all permutations of 
(1, ..., n — 1} and let, similarly, S(A; B) denote the set of all bijective maps from 
A C Nto B CN, with A, B having the same (finite) size. Moreover, the definition of 
the signature sign is extended to S(A; B) in the obvious way (as being -Е1 depending 
on the number of inversions being even or odd). Moreover, for a monomial x in the 
variables 51, ..., 5—1 we denote by ту p the coefficient of any polynomial p w.r.t. 
the monomial x. In order to establish Lemma 3.7, we need to prove that 


2(п—1) 
Vx є U {s1, eg ae : Ty det A = пуа. 
k=0 


We distinguish different cases according to the degree. 


Case 0. For deg x = 0, i.e., x = 1, we have 


Pnn 
det p 


п—1 
madetA= У, sign(o) | [ /"? = det o; = Афр) = 


OESn-1 i=l 


= та. 


Case 1. For some fixed s; we have 


Tsy det A = > sign(c)(—1) pf Ф" П gre) 


RE іє{1....„п= 0—10) 
d oem П pO 
ie(L,..n-1ME] 


= —2 >, sign(c) p^ ?" П poo 


05-1 ie(1,...,n—1)MK] 
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by symmetry of p~!. There is a one-to-one correspondence between $,., and 
$({1,...,п}\\{К};{1,...,п — 1р given by e ос defined by 


on. | i € {1,...,п = 1\6), 
o(i) = І 
o(k), і = п. 


Moreover, опе can see that sign(o) = (2D sign(c). Hence, we obtain 


Ts, det A = —2 >, sign(c) p? П pic) 
O€Sn-1 ie{1,....n—1}\{k} 
= 2(-1)*" ` sign)" р 0 
бєб$({1,...,п}\{К};{1,...л—1}) ie(1,...,n—1) MK] 
= 2(—1)**" det pi. 


AE 2 kn 
—2Adj(p )m- dus = туа. 


Case 2. We consider x = s;s;. For simplicity, we assume k = l (К 4 І works 
analogously). We have 


T2 det A — >, sign(ca) | 1к=с(ур"" П pio) 4 
и ie(1, n 1A] 


=f Н г 
4 krzew p "0° (k)n П pee 
ie(1,...,n—1)MK,o-1 (K)) 


We construct a bijective map from 5,1 to S((1,..., n} \ {k}; {1,..., a} \ {k} by 
mapping с € 5,1 to o defined by 


n, i —n, 


^ | i € (L,...,n = 1) AME) 
o(i) = 


for the case k = o(k) and 


a(i), ie{l,...,n — 1}\ {k, o7! o 
a(i) = 1n, i-o (Kk), 


o(k), i-n, 


else. Note that it is easy to see that sign(c) = sign(c). Hence, we have 
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ту det A = >) sigo [| 0 


oESy—| ie{1,...,n}\{k} 


E >, 510п(5) П p? 


бєб({1,....п}\{К};{1,...,п}\{К}) ie{1,...,n}\{k} 


-1 
= detp.. = T24. 
Peg зр 


Higher order terms. Regarding the higher order terms, we note that mya = 0 for any 
monomial of degree larger than two. Therefore, the same should be true for det A, 
where it does not to seem to follow from an obvious argument. Note that we only 
need to consider polynomials where each individual variable są appears at most two 
times, as any other monomial cannot appear in det A by the definition of A and of 
the determinant. But any coefficient of det A with respect to such monomials can 
be understood as the determinant of a matrix p~!, which is obtained from p^! by 
omitting one row and one column апа by replacing some rows/columns by copies of 
other rows/columns. Of course, any such matrix р has vanishing determinant, imply- 
ing that 7, det A = 0. For concreteness, we indicate this mechanism by appealing 
to two special cases. First, take x — 525], 1 Æ k. Similarly to the case of х = sg, one 
can show that 


Ts det A = —2 X”. sign(c) [t-00 tm П pio 
ie{1,...,n—1}\{k,1} 


—1 —1 жыл бу 
+ 1,2505 07 ©" o? Wn g7 (Dn П pu 
ie(1,...,n—1]Mk,o-! (k)o-! D} 


oESp-1 


which is (the multiple of) the determinant of p, which is obtained from Pis by 


replacing the /th row by the last row. As the last row appears twice in p—!, the 
determinant, and hence T 25 det A, vanishes. 
The mechanism is even more transparent for the most extreme monomial x = 


2 In this case, 


2 
Sy an ee 


752.52 | det A = >, sign(c) (p "^! = 0, 


oESy-1 


as the determinant of the (n — 1) x (n — 1) matrix with all entries being equal to o". 
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A Remark on Gatheral's *Most-Likely Path 
Approximation" of Implied Volatility 


Martin Keller-Ressel and Josef Teichmann 


Abstract We give a new proof of the representation of implied volatility as a 
time-average of weighted expectations of local or stochastic volatility. With this 
proof we clarify the question of existence of ‘forward implied variance’ in the orig- 
inal derivation of Gatheral, who introduced this representation in his book ‘The 
Volatility Surface’. 


Keywords Implied volatility - Local volatility © Most-likely path 


1 Gatheral’s Most-Likely Path Approximation 


In his book “The Volatility Surface—A Practitioners Guide’, Jim Gatheral presents 
an approximation formula for the implied volatility of a European option, when the 
underlying stock follows a general diffusion process 


dS; 


t 


The *most-likely path approximation’ to implied Black-Scholes volatility in this 
model consists of two parts: The first part is the assertion that implied variance—the 
square of implied volatility—can be written as a time-average of weighted expecta- 
tions of a? (t, S;): 
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2 1 (^ae [2 
Timp(K, T) = 7], Ў le (t, s)| dt . (2) 


Here, the measures G, are given by their Radon-Nikodym derivatives with respect 
to the risk-neutral measure Q, 


dG; = 52,Гвѕ(5,, ©к,т (0) (з) 
dQ Е[52,Гвѕ(5;, FK, r] 


where Ox r (t) is a function that is yet to be specified, Гвѕ denotes the Black-Scholes 
Gamma and expectations are always taken to be under the risk-neutral pricing mea- 
sure. Let us emphasize that (2) is an exact formula, and that it is the second part of 
the method where the approximation happens: Gatheral argues that the density (3) 
is concentrated (as a function of (t, $)) close to a narrow ridge connecting today's 
stock price So to the strike price K at time Т, and claims that a good approximation 
to (2) is to evaluate it as if the density was entirely concentrated on this ridge.! In the 
terminology of Gatheral this ridge is called the most-likely path and the described 
approximation method the most-likely path approximation. Extensions of the repre- 
sentation (3) have been proposed e.g. by Guyon and Henry-Labordére [2] for implied 
correlations. 

In this note we will only be concerned with the first part of Gatheral's method, 
i.e. the derivation of the exact Eq. (2), and in particular the definition of the yet 
unknown function ск,т (f). Gatheral [1] defines on p. 27 first the “Black-Scholes 
forward implied variance’ окт (t) by 


Е [c?(t, 5,)52,Гв5(5;, oK.T(t))| 


ux T(t) = : A (4) 
= [S? as (Sr, 9 x,r(0)] 
and then, in the equation below, the quantity c x, r (t) by 
2 1 : 
10 = р | "ronde. 65) 
i T = t t 


Differentiating (5) and inserting into (4) yields an ordinary differential equation for 
ск.т(1). This definition through an ODE leaves open the question whether (and 
under which conditions) the quantities ок т (f) and ск, т (t) actually exist.? We will 
show that a simpler definition of 7x ,7 (t) can be given, which clarifies the problem of 
existence, implies Eqs. (4) and (5) and finally leads to a proof of the implied volatility 
representation (2). 


'See Gatheral [1, p. 29ff] for details. 


?See also Lee [3, Sect. 2.3], who remarks that the proof in Gatheral [1] hinges upon the assumption 
of the existence of ок т (t). 


A Remark on Gatheral’s *Most-Likely Path Approximation’ of Implied Volatility 241 


2 A New Proof of the Implied Volatility Representation 


For our proof of the implied volatility representation we assume that the stock price 
follows an Itó-process with respect to the risk-neutral measure Q (with respect to 
which all expectations are taken) of the form 


dS; 
еи (6) 


1 


such that the discounted stock price (e "5,)o-,-7 is a square-integrable martin- 
gale. The volatility process c is a general predictable, W-integrable process. This 
setup covers in particular local volatility models, where с; = o (t, S+) and stochastic 
volatility models where с; = a(t, V;) and У, is a stochastic factor driving the volatil- 
ity. We fix a terminal time T' and assume that S is non-deterministic in the sense that 
P(S; Æ Sr) > 0 for all t € [0, T]. Fixing also a strike price К we are ultimately 
interested in the implied Black-Scholes volatility cimp(T, К) for a European option 
with expiry Т and strike К in the above model. 


2.1 A Regime-Switching Model and Implied Forward Total 
Variance 


To start our derivation, we associate for each и € [0, Т] and X" > 0 the ‘regime- 
switching’ process S" to S, given by 


a! =rdt+o,dW, гє [0,u] 
x (7) 
At =rdt+E"dW; t e[u, T]. 


The process S" switches, at time f = и, from the dynamics (6) to Black-Scholes 
dynamics with constant volatility X". It should be obvious, that ST — S, while 
S? is simply a Black-Scholes model with volatility X9. In what follows, it will be 
helpful to consider the total variance wą = (T — u)(X")? instead of X". By simple 
conditioning, the price of a put option on S“ with strike К and maturity T is given by 


à "Tg [CK — Sc] Lg ie r(T—u) z [(K — Smr] 
=e d ù [Pgs (u, Su, T; K; wy)]. 


where Pps(u, S, T, К; ш) is the Black-Scholes put-price parametrized by total vari- 
ance, i.e. 
Рвѕ(и, S, T, К; ш) = e" ") K(—d5) — S®(—d)) 


242 M. Keller-Ressel and J. Teichmann 


and 


Definition 2.1 Боги є [0, T) we define the implied forward total variance ©, = 
№. (Т, К) > 0 as the solution of 


e ""E [Рвѕ(и, Su, T, К; Ôu)] = е "E[(K — Sr) +] (8) 


i.e. 10, is the total variance ш, = (T — u)(X")? that has to be chosen in the regime- 
switching model (7) such that the resulting put-price coincides with the put-price 
from the original model (6). 


Proposition 2.2 There exists a unique positive deterministic function и œ> Wy, such 
that the equality 


e "E [Pss(u, Su, T, K; à,)| = е "E[(K — Sr)+] (9) 


is satisfied for all u € [0, Т]. 


Proof For ш = 0, the Black-Scholes price e~™ Рв (и, Su, K, T; ш) equals е" 
(e 179 K — $,),. Since (e ""S,)o-,-r is a martingale, we have by Jensen’s 
inequality that 


e^"'E [Pas(u, Sy, K, T; 0)] = e™™E [7 7? K — $94] < e""E[( — 5т)+]. 


For ш — оо the Black-Scholes price Рв$(и, Su, K, T; ш) approaches e^" 7? К. 
In this case we get 


e "E[Pss(u, Su, T, К; 00)] =e" K > e '"E[(K — Sr)+]. 


In addition w + Pps(t, $+, T, K; ш) is for any given 5; a continuous and strictly 
monotone increasing function (here we need the non-degeneracy assumption on 5), 
hence also ш +> E[Pgs(t, S;, T, К; w)] is. Therefore we conclude that (9) has a 
unique solution w, for each u є [0, Т]. 


Remark 2.3 Notice that the previous proof holds in fact for semi-martingales S, such 
that (ехр(—г?)$)р<т<т is a martingale, so neither square integrability nor absence 
of jumps are needed. However, we do not get regularity assertions for и œ> Wy. 
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2.2 Main Result 


We now present our main result on the implied forward total variance w,. Here 
the assumption of continuous trajectories is really needed, as well as the following 
L?-continuity assumption: 


Assumption 2.4 We assume that о, is mean-square continuous, i.e. the map 
[D T] > u |> gi € 12(9, Q) is continuous with respect to the L?-topology. 


Theorem 2.5 Under Assumption 2.4 the mapping и +> Wy is in C'[0, T)NC°[0, T] 
and satisfies the ODE 


aw, & [ O(d2(tiy))o2 | 
= 0, T), 10 
pa pecu" 65 Е 


with terminal condition lim, , 7 Фт = 0 and where ф denotes the standard normal 
density. For и = О it holds that 


ûo(T, K) = Toi, (T, К), 


їтр 


where сітр(Т, К) is the implied Black-Scholes volatility for time-to-maturity T and 
strike K in (6). 


Remark 2.6 Equation (10) can be rewritten as (2). Alternatively, it can be written as 


| OU, т |2 (d2(Wy)) 2 
ou jp] ee peaa) 


i.e., the rate of decrease in total implied variance is given by expected instanta- 
neous stochastic volatility plus a correction term that accounts for correlation effects 
between о, апа S, in a highly non-linear way. 


Proof We set 


F(u, ш) =e ""E[Pas(u, Su, T, К; w)]. 


Note that the derivative of Pgs with respect to total variance ш is given by 


д 1 
р T, K; w) = —=S¢(d 
ou в$(и, S, T, К; w) za 1), 


which, inserting S = S,,, is uniformly integrable in w on each interval (є, оо), є > 0. 
Hence for w € (0, oo), 


fa) —ги | —rT 
55 96 ш) = 3 ?[S,0(d1(w))] = 37m 


? [b(d2(w))]. (11) 
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Applying Ito's formula and using the martingale property of S we obtain 


2 ded uen d 
— — Pgsr 
Du PST gg Be +355 


О FQ ш) =е ru | + САР | (12) 
и 


Parameterized by total implied variance, the Black-Scholes put-price Pgs satisfies 


д д 
—rPps + — Рвѕ + rS — Рвѕ = 0, 


ди OSs 
such that (12) simplifies to 
д "OE o? 2 c? Dg 2 
a, Е(и, ш) = et | т ag; Pes Sho J= 2 yg Eewa) 013) 


Note that due to Assumption 2.4 both д, F (u, ш) and Oy F (u, ш) are continuous. 
Furthermore, recall that Ù, is given in Definition 2.1 by the implicit equation 


F(u, ty) =e" E[(K — $7)4]. (14) 


where the right hand side depends neither оп и nor on 0. Let us first examine the 
boundary behavior of F (u, ш). We easily derive that 


lim F(u, w) = (е 'TK —e-"S,) | 
w>0 + 
jim F(u, ш) = oor, 


lim F(u, ш) = Ppgs(0, So, К; ш), 


u—0 


lim F(u, ш) = eT E[®(—do(w)) K — ®(—d(w))S7]. 


uT 


By Jensen's inequality and the assumptions on the non-degeneracy of S it holds that 


[ek — e7" Su), | < e" "E[ — $4] < 77K 


for all и € [0, T). From (11) we see that 0, F (u, w) > 0 and hence w +> F(u, ш) 
is increasing for ш є (0, оо). Altogether, it follows that for each и є [0, T] a 
unique 1», solving (14) exists. In addition, by the implicit function theorem, 10, is 
in C'[0, T) п C°[0, T] with derivative 


д ou F(u, ш) = & [ Ф(4› (w4)0] 


Ou ^ — p Fu, w)  E[o(da(w,))] 
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where we have combined (11) and (13). The initial and terminal conditions for 
Wy, atu = 0 and u = T can be derived from the above boundary conditions for 
F(u, w). Indeed, 

Pgs (0, So, К; Wo) = C(K, T) 
implies that wo = Tof. , Where gimp is the Black-Scholes implied volatility corre- 
sponding to the put-price P(K, Т). Finally 


С [P(—d2(w))K — e(-di(w))Sr] = P(K, T) = E[(K — Sr)+] 


implies that w = 0 and hence both boundary conditions for w, follow. 
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Implied Volatility from Local Volatility: 
A Path Integral Approach 


Tai-Ho Wang and Jim Gatheral 


Abstract Assuming local volatility, we derive an exact Brownian bridge 
representation for the transition density; an exact expression for the transition den- 
sity in terms of a path integral then follows. By Taylor-expanding around a certain 
path, we obtain a generalization of the heat kernel expansion of the density which 
coincides with the classical one in the time-homogeneous case, but is more accurate 
and natural in the time inhomogeneous case. As a further application of our path 
integral representation, we obtain an improved most-likely-path approximation for 
implied volatility in terms of local volatility. 


Keywords Small time asymptotic expansion * Heat kernels expansion - Implied 
volatility * Local volatility model * Most likely path - Path integral 


1 Introduction 


Because of their consistency with the known prices of European options, and despite 
their unrealistic dynamical implications, local volatility models continue to be used 
in practice as powerful tools for risk management of equity derivatives portfolios. 
Under the forward measure (with no drift), local volatility models take the form 


dS, 
qoc ebd an (1.1) 
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where В, is a Brownian motion and oc; is a local volatility function that depends only 
on the underlying level S and the time f. 

Assume that prices of European options of all strikes К and expirations T are 
given or equivalently that the Black-Scholes implied volatility function ogs(K, T) 
is known. In that case, it is straightforward to compute the local volatility function 
c, from, for example, Eq. (1.10) of Gatheral [6]: 


КЫ л = vM 
(1-5 2) -1 (+t) (4) + 22 


where k denotes the log-strike k := log К/5 апа w, ће Black-Scholes implied total 
variance, given by w(K, T) := as (K,T) T. 

In practice, we observe option prices for only a finite set of strikes and expirations. 
Moreover (see for example Gatheral and Jacquier [7]), itis very hard if not impossible 
to find a functional form for implied volatility that both matches observed prices and 
is free from static arbitrage. One alternative approach is to assume a parameterized 
functional form for the local volatility function c;(S, t) and price a finite set of 
European options, tuning the parameters of the function until a satisfactory fit is 
achieved. Such calibration of local volatility models to given option prices is in 
practice typically performed using numerical PDE techniques. However, numerical 
PDE techniques are slow and moreover are not practical in higher dimensions. 

Alternatively, to achieve better understanding of the qualitative properties of local 
volatility models, and potentially faster calibration, both academics and practitioners 
have exploited asymptotic expansions of implied volatility in terms of local volatility. 
First, Berestycki et al. [2] solved the nonlinear PDE (1.2) for the implied total vari- 
ance v in the small time to expiration limit, obtaining an exact expression for implied 
volatility as an integral of local volatility. Subsequently, this asymptotic approxima- 
tion was extended, to first order in time to expiry т = T — t by Henry-Labordére (see 
the article in this volume and also Henry-Labordére [12]), and then to second order in 
Gatheral et al. [9] using the heat kernel expansion. Jordan and Tier [14] apply similar 
methods to derive an asymptotic solution for the SABR and CEV models. In related 
work, the paper of Cheng et al. [5] derives an operator expansion of the density, 
which up to first order agrees with prior expansions obtained using the heat kernel 
expansion. As an earlier example of work in a similar spirit to the most-likely-path 
approach of our paper, Baldi and Caramellino [1] develop a small-time expansion 
for the hitting probability of a one-dimensional diffusion. 

Our contribution in this paper is to derive an exact Brownian bridge representation 
for the transition density, from which an exact expression for the transition density 
in terms of a path integral follows. Indeed, the path integral representation of the 
density has often been used as a powerful tool for the derivation of improved asymp- 
totic expansions of the transition density. For example, in the foregoing, we apply 
a technique from the paper of Goovaerts et al. [10]. An earlier paper by Linetsky 
[16] provides a more general survey of the application of path integral techniques to 
option pricing. 


e? (K, T) = 
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By replacing all paths that contribute to the path integral by the most-likely-path, 
the unique path that minimizes the action functional in the path integral formulation, 
we obtain a new approximation to the transition density which is both more accurate 
and natural than the classical heat kernel version. As an application, we obtain an 
improved most-likely-path approximation for implied volatility in terms of local 
volatility. 

The most-likely-path (MLP) approach has been used to analyze the asymptotic 
behavior of implied volatility in stochastic volatility models in Gatheral [6]; this 
analysis is further elaborated in an article by Keller-Ressel and Teichmann [15] in 
these proceedings. Guyon and Henry-Labordére [11] and Reghai [17] both explore 
alternative definitions of the most likely path, achieving improved accuracy by con- 
sidering fluctuations around the MLP. In particular, Guyon and Henry-Labordére 
[11] compare and contrast various approximations in a unified setting. Though the 
approach of Guyon and Henry-Labordére [11] differs from our path integral approach 
in the current paper, it is worth mentioning that their heat kernel approximation is 
closely related to ours. Once again however, our path integral approach leads to an 
unambiguously natural definition of the most-likely-path. 

Our paper is organized as follows. In Sect. 2, we derive Brownian bridge and path 
integral representations for the transition density of one dimensional diffusions. As 
an application, in Sect. 3, we present a novel probabilistic derivation of the heat kernel 
expansion, also referred to as the WKB method in the physics literature. For time 
homogeneous diffusions, this new expansion recovers the conventional heat kernel 
expansion; however, in the time-inhomogeneous case, the two expansions differ a 
little. In Sect.4, we present heuristic derivations of known small time asymptotic 
expansion of implied volatility to zeroth order. From the path integral perspective, 
these known approximations are suboptimal in the sense that they correspond to 
computing the optimal path of an approximate but incomplete action functional. By 
considering the optimal path of the exact action functional, we show how an optimal 
approximation may be computed. An interesting feature of the optimal approximation 
is that it recovers the implied volatility of the time dependent Black-Scholes model 
exactly, which so far, to the best of our knowledge, none of the existing small time 
approximations are able to achieve. Finally, in Sect.5, we summarize and conclude. 

Throughout the text, B; denotes the standard Brownian motion defined on the 
filtered probability space (Q, F;, P) satisfying the usual conditions. X; denotes the 
Brownian motion with some drift h. p* (T, y|t, x) denotes the transition density 
of X from x at time t to y at time Т and similarly p?(T, sz |t, s+) is the transition 
density from s; to sr of the process S;. Moreover, dot will always refer to the partial 
derivative with respect to the time variable and prime to the space variables x or s. 
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2 Path Integral Representations for Transition Density 


In this section, we derive path integral representations of the transition density and 
of the call prices under local volatility, which will in turn yield the most-likely-path 
approximation to implied volatility. The key ingredient in this derivation is a Brown- 
ian bridge representation for the transition density, which though straightforward, 
does not appear to be well-known. 

We start with the case of one-dimensional Brownian motion with general but 
Markovian drift. We reduce the more general diffusion case which concerns us here 
to this one by applying the well-known Lamperti change of variable. 


2.1 Brownian Bridge Representations 


Two Brownian bridge representations for the transition density of Brownian motion 
with general but Markovian, smooth and bounded, drift are derived in Theorem 1. The 
first expression, (2.1), will be used in the derivation of the path integral representation 
for transition density in Sect. 2.2 and the second, (2.2), will be used to derive the heat 
kernel expansion of transition density in Sect. 3. 


Theorem 1 Let X; be a Brownian motion with drift driven by 
dX; = d B, + h(X;, t)dt, 
where the drift h is assumed smooth and bounded. Let H be an antiderivative of h 


with respect to x, i.e., 2H, t) = h(x,t), for all x and t. The transition density 
рх of X, has the following two equivalent Brownian bridge representations: 


ET T T 
p* (T, уй, x) = ФТ — 1, y 2 x) Ey [e h(X,,s)dX,—3 f; “© (2.1) 


and 
p*(T, ylt, x) = (T — t, y — x) e# 0 D-H 00) x 
б, [e LIT ac RAE Q.2) 
2 ~ 
where ф is the Gaussian density $(t, €) = NL e 2. The notation E, ,|-] denotes 


the expectation under the Brownian bridge measure from x to y. 


Proof Note that X, under the original measure P is a Brownian motion with drift Л. 
Define a new probability measure P through the Radon-Nikydom derivative 


D 
d — e- SE h(Xs,8)d B,—1 [7 h?(Xs,s)ds 
dP ` 
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By the Girsanov theorem, X; is a Brownian motion under P. Given any bounded 
measurable function f, we have, since dB; = dX; — h(X;, t)dt, 


where, for notational simplicity, 


Zi x[:] denotes the conditional expectation E[:| X; 


e dP E T 1 pT 
aL Ё [SAD] = fine [Fer Wl nmn Po), 


x], and similarly for IE; , [-]. It follows that, for any bounded measurable function f, 


T" Т _1 pT 2 
[fora г. дау = | rw з К h(X,,s)dX,—À fT h d 


i 


Ф(Т —t,y — х)ау, 


where E, yL] = E[.|X, = x, Xr = y]. Consequently, (2.1) follows, i.e., 


p* (T, ylt, x) = Ф(Т — t, 


Furthermore, Ito's formula impli 


T 
i h(X;, s)d X, = H(Xr, T) 
t 


y — х)Ё, А | Мы A iud | 


es that 


T 
ноход | [mos 
t 2 


where we recall that Н is an antiderivative of Л with respect to x. Thus, 


efi W(Xs,s)dXs _ „М 


We further rewrite the transition 


Qc T)- H ОХ) Ј [H css) 9 Jas 


density as 


p*(T, ylt, es = t, y — x)e” ten 


ig 


Е [e кра ылыан, 


This completes the proof of (2.2). 


hy (Xs, | ds, 


Remark 1 We remark that the conditional expectations in both (2.1) and (2.2) are 
under the Brownian bridge measure since X; is a Brownian motion under P. One 
intriguing feature of the representation (2.2) is that, if we Taylor expand the condi- 
tional expectation for small T — t around the straight line connecting the initial and 
terminal points, we recover the heat kernel expansion in the time-homogeneous case 
and probably do better than the heat kernel expansion in the time-inhomogeneous 
case. See Sect. 3 for more detailed discussions on the heat kernel expansion. 
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Now for the general diffusion case, consider the process $; driven by the stochastic 
differential equation (SDE) 


45, = (8; , t)dt F a(S;, Dd B;, So = 50, 


where for simplicity, we assume the coefficients у and a are Lipschitz and of linear 
growth; a is further assumed strictly away from zero. By applying the Lamperti 
transformation x — JS is ЕБ, the process S; is transformed into a Brownian motion 
with drift. Specifically, denote the transformation from s to x by x = y(s,t) = 
[s ET Applying Ito's formula to X; = (S, t) yields 


d X, = 40(5;, t) 


: a^(S,,t 
= |i. t) + nC tos t) + ent) A n| dt + ps(Sr, t)a(S;, t)d B, 
. I; t) as (S;, t) 
= t B 
Pto 2 и 


= dB, + h(X,, t)dt, 


where subindices of ọ and a refer to partial derivatives. The function й is defined as 
h(x,t) = Qs, t) + AES — абы), with s = y~!(x, t). The transition densities p? 
for 5; and p* for X; are then related as 


5 _ 1 X 
Dp? (GL, sr|t, s) = р QG,xrlt, xi), 
а(ѕт, T) 


with xr = (sr, T) and x, = q(s;, t). Thus, the transition from the Brownian 
bridge representation for p* to a similar representation for p? is straightforward by 
applying Theorem 1. Theorem 2 formalizes this result. 


Theorem 2 Let S; be the diffusion process driven by the stochastic differential equa- 
tion 


dS, = (Si, t)dt + a(S;,t)dB,, So = so. 


Denote the Lamperti transformation from s to x by x = (s, t) = f Н EE Define 
the function h by h(x,t) = Ф(5,1) + AR — “60. with s = p(x, t), where 
subindices refer to corresponding partial derivatives. Let H be an antiderivative 


of h with respect to x, namely, ZH (x, 1) = h(x,t), for all x and t. Then the 
transition density p’ of S, from (t, s;) to (T, sr) has the following Brownian bridge 
representations: 


S OT — t, фФ(5т, T) — p(s, t)) а 
p “(T ѕт |і, s) = mq Eos. ter. T) 


x [е7 sei Mid (2.3) 
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and 
T — t, (sr, T) — p(s, t Й = 
p(T. spit, A PCT, T) — eG D). nor T) T) -HGGu0 „ 
a(sr, T) 
i5 —4 f I2 Xs s) hy (Ху) -2 Hr (X, 5)ds 24 
Lo(st)o(sr,T)|€ 7°! ‚ (24) 
; ; | i.e 
where again denote the Gaussian density Ф(1,&) = e 2. As before, the 


A 2nt 


notation E. yl] denotes the expectation under the Brownian bridge measure from x 
to y. 


Note that the X, process in both expressions (2.3) and (2.4) is a Brownian bridge 
from хт = q(sr, T) to x, = (si, t). One application of such Brownian bridge rep- 
resentations of transition densities is to devise more efficient simulation schemes. For 
example, for some given function f, we may compute numerically the expectation 


čt s Lf (ST)] in x-space as 


2s LA GI = Elf (o Xr] = / jou Tn раа 


- | foq Gr. T) OF =i Ae 


D le 23 ИНО 0и] 


x Esas хт 


4x7,Y 


Е [оф 1 cy, Tje O D-H) Ë [ pO STUNDE 


where Y is a normal random variable with mean x; and variance T — t. Therefore, if 
there is an efficient method to calculate or approximate the conditional expectation 


~ її? А . 
Suy le [^ киыны ныи! їп the Brownian bridge measure, the 


expectation E, , [f (Sr)] could potentially be computed more efficiently. Since X; 
is a Brownian bridge from x, to Y, one obvious approximation is to simply replace 


І . ~ І (Ta : 
the integral in the exponent of E,, у [е Zh ^ po) PEDE HORDE] with the 


integrand evaluated along the straight line x, = тт + TY for r € [t, T]. In 
other words, 


o le rhe ИИ 


ду е AT Hert) thy Отот) 2H Gu rd Q.5) 
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Hence, 


г, s [LEGDI Z e 3 Ј Eos mh Gs HH Gur) dr 
ө 


: [7 о (ү, Т) ds] 


after which we need only simulate the normal random variable Y. 

The straight-line approximation in (2.5) seems somewhat ad hoc. Would we do 
better with another path? Why not add two extra paths to take some account of the 
variability of the random paths in the full integral? More generally, is there an optimal 
or systematic way of picking these paths? The path integral representation in Sect. 2.2 
may provide a partial answer to this question. The Brownian bridge representations 
(2.1) and (2.3) play a key role in the derivation of this path integral representation. 


2.2 The Path Integral Representation of the Density 


In this section, we provide a formal derivation of the path integral representa- 
tion exploiting the Brownian bridge representations of Sect. 2.1 and the Chapman- 
Kolmogorov equation. As in Sect.2.1, we will use the notations y(s;, f) and x; 
interchangeably. 

Let (f = tg < ti < --- < tn = T) be a partition of the time interval [r, Т] 
with Af; = ti — ti—1 = Т, fori = 1,...,n. By iteratively applying the Chapman- 
Kolmogorov equation, the transition density рэ (Т, sr |t, s) can be written as 


n 
pe, sp|t, Sst) = f Я [Пг Silti-1, Si-1) ds1 ...dsa—1, (2.6) 


i=l 


where we set so = s; and s, = sr. Recall from (2.3) that the transition density p 
of S, from (ti—1, sj 1) to (fj, s;) has the Brownian bridge representation 


pP ti, Silti-1, sii) 


_ PAT, plsi, ti) — Y(Si-1, fi-1)) x 
== alsi, ti) "pGi-1.li-1).eGi.sti) 
I5 *l 


jet ht dX. f | аи 


, 


where X; is a Brownian bridge from ọ(si—1, ti—i) to v(si, ti). We next compute the 
limit of (2.6), as At > 0* (or equivalently n — oo), assuming that, fori = 1,...,n, 
the s;'s form a discretization of a differentiable curve s+, for 7 € [t, Т]. 
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We have 
a fi WX dX} fj WX, ndr 
lim No(s: f ot) | ei , TTZ J| , 
"(si 1. ti-1).v isti) 
п оо 
i=l 
= оЈ бт), т) f h? (ұ(вт,т),т)ат 
апа 
п 2 
lim e zii lei) Gi-isti-1)] 
n—oo 


i-l 


2 
— lim oom YaleGit)-eGi-i.ti-0]* 
noo 


-Axt [oa Serna] А+О(лз?+л)? 


= lim e 
п оо 


= ei h [eon eoe ar, 


Substitution into (2.6) and taking the limit n — оо yields the following path integral 
representation for the transition density p? 


1 T. / E ae hits 2 
p(T, srt, s) x CEM enDi- ATDS, (27) 
@ 


where 
ds; 


lim 
п 00 ./27At a(sr, T); П М2т At a(si, ti) 


and C, denotes the collection p all differentiable curves from (t, s;) to (T, sr). 
Equivalently, because dx; — TOY we may rewrite the path integral representation 
(2.7) more neatly and simply in x-space as 


1 Т. 
р°(Т, эт |2, s) = xm gi t rhan D dr pp] (2.8) 


where 
п—1 d 
D[x] 2 lim : 
noo Ag At H A/ 2m At 


and C, denotes the collection of all differentiable curves from (t, x;) to (T, хт). 
We shall henceforth deal mostly with the simpler expression (2.8). Heuristically, one 
could think of the path integral representation (2.8) of the density as an exponentially- 
weighted average over all possible differentiable curves connecting x; to xr. D[x] 
could then be regarded as the “Lebesgue” measure on the space of differentiable 
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curves connecting x; to хт, though mathematically such a measure does not really 
exist. 

Assume now that under the pricing measure (assuming zero interest rate and 
dividend yield), the price S; of the underlying is driven by the SDE of local volatility 
type 

dS, = a(S;, t)d By. 


The path integral representation (2.7) of the transition density p? in this case has the 
following simpler form 


2 
l Т: ST | as(sr,T) d 
pT, srs) f oN pitt EP] npa 


C; 


Integrating the payoff function over the transition density, the path integral represen- 
tation for call price is immediate: 


ir 


оо 1 fT as (ѕт,т) 2 
C(t, st, K,T) E (sr — ю | eth Е м: | ^ Dis]dsr, 
K Cs 


or equivalently in x-space, 


99 —K T 
CU з, K,T) =) ži e ЇЙ rhen DPD dsr, (2.9) 


K alst, Т) Je, 


— sC st) 
where h(x, t) = p(s, t) — 557. 


3 Probabilistic Derivation of the Heat Kernel Expansion 


The heat kernel expansion is a small time asymptotic expansion of the fundamental 
solution of the heat equation over a Riemannian manifold. Reexpressing the transition 
density of a diffusion process in terms of this fundamental solution leads naturally to a 
small time asymptotic expansion of the transition density. This topic is well-studied 
in the Riemannian geometry literature, see Chavel [4] for a geometric analytical 
approach and Hsu [13] for a probabilistic approach. In the physics literature, the heat 
kernel approach to deriving small time asymptotic expansions is also known as the 
WKB method or the ray solution, see Jordan and Tier [14]. Deriving such expansions 
in one dimension is much simpler than in higher dimensions where no analogue of 
the Lamperti transformation exists. 

Though the heat kernel expansion is very well-known, the Brownian bridge rep- 
resentation (2.4) of Theorem 2 leads to a novel probabilistic derivation which we 
will now present. To fix ideas and illustrate the methodology employed, we start 
with the case of Brownian motion with drift; as before, the general diffusion case 
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follows via the Lamperti transformation. To minimize mathematical technicalities, 
we shall assume (at least in this section) that all functions are bounded with bounded 
derivatives. 


3.1 Heat Kernel Expansion for Brownian Motion with Drift 


Theorem 3 Let X, be the Brownian motion with drift h, i.e., X, satisfies the SDE 
dX; = к + h(X;, t)dt. Denote by Н an antiderivative of h with respect to x, 
namely, 2 gx H (x, t) = h(x, t), for all x and t. The transition density p of X; has, 
as t > T , the following small time asymptotic expansion: 


p*(T, уг, x) = ФТ — t, y — x) cP ODA) y (3.1) 


= 71 [= s) + hy (x5, s) + 2H, (x* s)| 
2 ; s? X Ws» tls > 


ds +O(T — 2i 


| | | ge c 
where ф is the Gaussian density $(t, £) = e 2%. x* denotes the straight line 


1 
Nn 
from (t, x) to (T, y), i.e., xf = x + T (y — x) for s € [t, T ]. 


Notice that in the time-inhomogeneous case Л = h(x, t), the approximation (3.1) 
is different from the heat kernel expansion (see, for example, (3.3), (3.6), and (3.7) 
on page 603 of Gatheral et al. [9]) in that the approximation in (3.1) involves an 
integration from ft to Т whereas, in the classical heat kernel expansion, all quantities 
are evaluated at the fixed initial time t. Of course, in the time homogeneous case where 
the drift Л = h(x) has no explicit dependence on f, the expansion (3.1) coincides 
with the classical heat kernel expansion as formalized in the following corollary. 


Corollary 1 (Heat kernel expansion for Brownian motion with drift) For Brownian 
motion with time homogeneous drift h = h(x), the transition density рХ of X, from 
(t, x) to (T, y) has the asymptotic expansion up to first order as 


p(T, ylt, х) = Ф(Т —1,у— xje VHO) 


T-t i 2 / 2 
h- f ["©+ҥ®]ж+от-|, 


which coincides with the classical heat kernel expansion up first order (see, for 
instance, Gatheral et al. [9]). 
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Proof In this case, Н, = 0 because Л, = 0. The integral in (3.1) can be evaluated as 
T 
/ [Ha $ ДЗ] ds 
t 


3 2 S ‚ = [ 
/ L (e o-o) +r (1+ 0-0) as 


== [Po +n ©] 


X 


where in the last equation we used the change of variable £ = x + = (y — x). 


Let Y, denote the Brownian bridge from x at time f to y at time Т and M yl] 
be the expectation under the Brownian bridge measure. The proof of the asymptotic 
expansion (3.1) requires the following two lemmas. 


Lemma 1 Fora bounded function g = g(x, s), |g| < M say, we have the following 
estimate 


T 
" T " 
Ex,y [е nd = 1+ f ty 190, 5)]45 + ОСТ — 1)’. 
t 


Proof The proof is based on a clever application of the convex order for random 
variables first observed, to our knowledge, in the paper by Goovaerts et al. [10] (see 
Proposition 6.2 on p. 348). Denote by Оу, ,) (9) Ше qth quantile of the random 
variable g(Y;, s). Since exponential functions are convex, it follows from Proposition 
6.2 of Goovaerts et al. [10] that 


1 
E, , [e 900] Е f elt Qos Dds dq, 
0 


We establish an upper bound for the right hand side. First we Taylor expand the 
integrand and rewrite the integral as 


Lh og , © q pl T k 
eh Qs, sy (q) 544 = > Es Owy,.s)(q)ds dq. 
0 9 M Јо Mi 


k 
An upper bound for i ( i Оу, з) аз) dq is then determined as 


1 T k 
f (/ Quo. ds dq 
0 t 


1 T 
= f ra I [О су,» (4) ds 4да (by Hölder’s inequality) 
0 t 
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I 
= (Т —1)* E Ex ylg(Ys, 5)| ds 
t 


< M'(T —1)* (since |g| < M). 


Thus, 


1 T. 
n eh Og(Y;.s) (q) ds dq 
0 


1 pT © q pl T k 
=1+ Оло.) (9) ds dq + У — Oso, s (q)ds | dq 
0 Jt k=2 k! 0 t 


Т оо 
= ПИБ 
1+ / х,у 1905, 5)] ds + > uM (T – 0)" 


ІЛ 


ІЛ 


T 
1 «f £y Lgs, 5)] ds + M^(T — 14 M0 79, 
[4 


which completes the proof. 


Lemma 2 asserts that the time integral of the conditional expectation in Lemma 1 
is approximately, up to order (Т — t)*, equal to the integral along a straight line 
connecting x at time ѓ to y at time T. 


Lemma 2 For a bounded function g — g(x, s) with bounded second partial deriv- 
ative with respect to x, the following asymptotic holds. 


T T 
/ Ex y [0(У;, 5)] ds = / g(xs,s)ds + O(T — ty), 
t t 


where xs denotes the straight line x, = x + T (y — x) from (t, x) to (T, y). 


Proof Taylor's theorem implies that 


9(Ү;, 5) = д(ху, 5) + gx xs, S)(Ys — xs) + dt) (Y; nN: 


for some £, between Y, and xs. Since Y; is a Brownian bridge from (t, x) to (T, y), 
Y, is normally distributed: Y, ~ № (ss. pa. Therefore, 


Ey [g(Ys, 5)] = 905, 5) + gx is. 5) Ey [Ys — xs] + аа DE [o = x? 


2 
gsx (s, 8) (s — (T з) 


= g(s 5) + 2 T; 


Hence, by the assumption that |gxx| < К, 
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T Т Т _ _ 
f ®.ш0%. mas = f возах | 2,8 09 a, 
t 


Ў t T —t 

T К [T (s — tY(T — s) 
< ‚5)а а 
[sesion E f ®=0@=® 


т К 

2 

=) g(x,,s)ds + — (T — t). 
i 12 


The proof of Theorem 3 is now straightforward. 


Proof (Proof of Theorem 3) By combining the two asymptotics in Lemmas 1 and 
2 with g(x, 5) = h2(x,s) + hy (x, s) + 2H, (x, s), under the assumption that g is 
bounded with bounded second partial derivative with respect to x, we obtain 


Eeg le MV ы! 
1 й 2 2 
-1-5 С (ху, 5) + Ay (ху, з) + 2H, Qs, s)| ds + OCT — ty. 
t 


Recall expression (2.2) for the transition density: 


p*(T, ylt, x) = (T — t, y — x) e# 0 D-H 01) x 
E, y [e ыт ОИ | ! 


Substituting the approximation of the conditional expectation above, we obtain 
p (T, yt, x) = (T — t, y — x) e” G^ D-H6:0 y 


T 
fı- J [Pas D s. + HG ] dee OCF — 0° |. 


О 
3.2 Heat Kernel Expansion for Nondegenerate Diffusions 
For general nondegenerate diffusions, consider the process 5; driven by the SDE: 
dS, = a(S;, thd B; + (S, t)dt. (3.2) 


Again the Lamperti transformation allows us to carry over the small time asymptotic 
expansion (3.1) in x-space to s-space. Specifically, recall that the Lamperti transfor- 


mation x; = (s+, t) = / a om transforms the SDE (3.2) into a Brownian motion 
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with drift dX, = d B; + h(X;, t)dt, where h(x;, f) = (si, t) + из) — 2680 and 


that the transition densities p for S, and p* for X; are related by 


5 _ 1 X 
Dp? (GL, sr|t, s) = р QG,xrlt, xi), 
a(sr, T) 


with xr = (sr, T) and x; = (sr, t). Hence, a small time asymptotic expansion 
ast — T- for p? can be obtained by simply applying the expansion (3.1). This 
argument is formalized in Theorem 4. 

Theorem 4 The transition density p? of the process S, driven by the SDE 


dS, = a(S;, thd B; + w(S;, t)dt 


has the small time asymptotic expansion as t — Т^ 


ФТ =t, PCr, T) 7 PD) nor ry T) HGi0.0 (3.3) 


5 
T,sq|t,s;) = 
p Tt, St) als. Т) 


1 Т 
х | — al le (oe, т) +h (рт, т) + 2Н, (рт, T)d T + O(T — 2d , 


[4 


Tt 


where фу = Te. D + т; (т, T). 


We stress once again that in the time-inhomogeneous case, а = a(s, t), the 
expansion in (3.3) is not identical to the classical heat kernel expansion as it involves 
an integral along the path у. On the other hand, in the time-homogeneous case а = 
a (s), (3.3) does recover the classical heat kernel expansion. In this sense therefore, 
we have derived a natural generalization of the classical heat kernel expansion. 


Corollary 2 (Heat kernel expansion for time-homogeneous diffusions) The transi- 
tion density p? of the process S, driven by the time-homogeneous SDE 


dS, = а(5,)аВ, + u(Si)dt 
has the small time asymptotic expansion as t — Т up to first order 


WT —t, р(ѕт) — vs) o Fev T) - Hop) 
a(sr) 


p(T, sr|t, st) = (3.4) 


c= тт , ds 
х 1 7 206 = oD) [ [Peo + o ec) | aT 
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where p(s) = [is E ho (5) = EG) — 2 and H is an antiderivative of h. 


i Я = 1 
ф denotes the Gaussian density ф(1, £) = sou 


expansion coincides with the classical heat kernel expansion up to first order. 


2 
-*5:. The small time asymptotic 


Proof We verify that the expansion (3.4) is indeed the classical heat kernel expansion. 
The classical heat kernel expansion up to first order (see, for instance, Gatheral et al. 
[9]) reads in our notation 


simt Ф(Т — t, plsr) — ф(5;)) T 
i 


E [^ [ Lu(s,sr) ds | 
сз u(s,sr) а(в)]” 
[т riu) 


d 2 ; 2 os 
a(n ao. and £ = f LO a + u(s)ð; is the infinitesimal 
generator associated with the oe Sr. In this case, the asymptotic expansion (3.3) 
reduces to 


where u(s, sr) = e 


Ф(Т — t, esr) — qi) gH ov Gr)- Нор(в) 
а(ѕт) 


1 T 
x |!-;/ оо едат). 
t 


where yr = T— (st) + T- (sr). Therefore, it suffices to show that 


eHov(sr)—Hoy(s1) = и(5;, ST) (3.5) 
and 


[ Lu(s,sr) ds 
= — (sr) u(s, sr) a(s) 


1 T: 
-3 | "eoe = (3.6) 
t 


For (3.5), since h o y(s) = A) =; q'(s) = and Н is an antiderivative of 


1 
a(s) a(s)? 
h, we have 


plsr) 


Н o ф(ѕт) — Ho q(si) = rd һ()45 = i h o p(s)dy(s) 


z Be 22d ds =f” ws) | EM 
= = 5 log : 
s La(s) 2 |a Jy a(s) 2 a(s;) 


Therefore, 


5 ) 
gH oer) Hop) — ED bods | asi) 


a(sr) 
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As for (3.6), since г = T— (sr) tope), we have 


—t 


T 
J & (pz) + hl (p.d 
t 


[в ое раа 
— — (5 — — (5 — — (5 —— T 
; Tug t Fist T тор Т:Т 


jm sr 
M а 8 f PAKEA. 
A A 


Note that dy(s) = as and 


, 1 d d 
h оф(ѕ) = — [л о ф(5)] = a(s) х d | 
$ 


v (s) ds 


us) aa 
a(s) 2. |? 


consequently, 


2 / 
el Г =f fe d u d ds 
[ [Peon ne] ae = f 69 (2-5) |S 


where we suppressed the dependence оп s for notational simplicity. On the other 
hand, for the right hand side of (3.6), by straightforward calculation we have 


а? (s) 
2 


2 ^2 IN / 
Д (a) afn a ap 
l 22 8 2 (: =) T ДЕ; 


1 ш а! 2 m a’ 4 
= (2-5) +a (2-5) u(s, ST). 
It follows that 


[ 5mm ds _ T (£-2)« = ds 
s, Us, sr) a(s) 2 7 а 2 a a 2 a(s) 


a ie h' d 
=-5 | [Peon +o] ase. 


Lu(s, sr) = ди (з, sr) + u(s)Osu(s, зт) 


which completes the proof of (3.6). 
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4 Implied Volatility Approximation 


The implied volatility ogs = ops(K, T) is defined implicitly by solving the nonlin- 
ear equation 
C(s, t, К, T) = Cps(s,t, К, T, ogs(K, T)), (4.1) 


where the function Свѕ on the right hand side is the celebrated Black-Scholes pricing 
formula for call options (assuming zero interest rate and dividend yield): 


with dj = E + + 288 УГ ,d = dı —opsVT — t, and N(-) is the cumulative 


normal distribution а. The Black-Scholes formula is monotonic increasing in 
the volatility parameter ogs, and for this reason amongst others, it is often market 
practice to quote options in terms of Black-Scholes implied volatility. Moreover, 
practitioners often calibrate their option pricing models to implied volatilities rather 
than price quotes. In this regard, efficient and accurate approximations of implied 
volatility not only permit faster calibration of option pricing models but also help 
build intuition. 

Conventionally, asymptotic expansions of implied volatility for small time to 
expiry (to lowest order) are generated by matching exponents in respectively, an 
asymptotic approximation for a far out-of-the-money (OTM) option under Black- 
Scholes, and an asymptotic approximation to the option price from direct integration 
over the (approximated) density. For such far out-of-the-money (OTM) options, as 
time approaches expiry, the event that the underlying will end up in-the-money at 
expiry is a rare event. According to the theory of large deviations, such a rare event 
has exponentially small probability, so the option price is of the form 


99 de) 
i e T- f (x) dx. (4.2) 
K 


Ast — T`, ће main contribution to the integral comes from the minimum point of 
d, which in this case is the boundary point of the support of f because, in the OTM 
case, d (x) is strictly increasing in x, and f (x) has the payoff function as a factor (see 
(4.4)). To zeroth order, the Laplace asymptotic formula (for example, see (5.2.23) 
on p. 193 of Bleistein and Handelsman [3]) then reads 


«o f'(K) 


á Toa ы A^ = dm эы = 
f f(x)dx © (T —t)*e ОКУ? (4.3) 


ast — Т, provided f'(K) and d'(K) are nonzero. Thus, the small time asymptotic 
expansion of the implied volatility is obtained by applying the Laplace asymptotic 
formula (4.3) to both sides of (4.1) then matching the corresponding coefficients. 
As one might expect, the dominating term of such expansions is typically the zeroth 
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order term. Our objective in this section is to demonstrate how to implement this 
matching procedure from the path integral perspective. 


Recasting Eq. (4.1) for implied volatility using our path integral representation of 
the density, and using our earlier representation (2.9) of the call price, we obtain 


[= $т = 5 he ib Ee hos D dr DET, 
к a(Sp,T) 


cà 
= [ Sr = K аншы ыры) f e MI атра; 
K C; 


OBSST 


1 log sp —logs; | vgg V T—t 2 
= ГА = қ лаге ) dSr. (4.4) 
K 2n(T — t)ogsST 


Equation (4.4) provides an implicit expression for Black-Scholes implied volatility 
in terms of local volatility. In the foregoing, we first show how to recover from 
(4.4) the heat kernel approximations of Gatheral et al. [9] and the most-likely-path 
approximation of Gatheral and Wang [8]. Finally, in Sect. 4.3, we show how to 
improve on these approximations by adopting the path integral perspective. 


4.1 Recovery of the Berestycki-Busca-Florent (BBF) Formula 


To rederive the results in Berestycki et al. [2] and Gatheral et al. [9] from (4.4), we 
approximate both sides of (4.4) as Laplace type integrals as in (4.2). The path integral 
on the left hand side of (4.4) is approximated as follows: 


n as q^ He Thes D) dr py] 
Cy 


=} е 12-2 f hos rider Ј IG ndr] 
C, 


-1 ff ix, Par 7 T2 
x e 25 т 1—2 h(xr, T)dXr + h“ (хт, т)ат | D[x] 
Е t t 


x = “ara = ^n + O(T — 1)], 


where in the last step we approximated the path integral by evaluating the integral in 
the exponent along a ne path: the straight line connecting x; and хт. Recall that 


X; = qs; t) = fia z XED B Substitution back into the left hand side of (4.4) gives 
] xxm сш. [e be Vieh Pat рү qr 
к а(8т, T) 
00 — der. Ту бв. о? Sp — 
x 1+ O(T —1)]45т, 
" e ar, T) 5 [1+ O( t)]dSr 
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which is of Laplace type as in (4.2). Applying the Laplace asymptotic formula (4.3), 
we obtain that, up to a factor, 


СКТ) бв 02 
C(s,t, K, T) ~e UTEN y (4.5) 
Likewise, the Black-Scholes price on the right hand side of (4.4) is given, up to a 
factor, by 


= ов K Log s|? 


Свѕ (5,7, K, T) е 2850-0) (4.6) 


Finally, by matching ће exponents in (4.5) and (4.6), we obtain the zeroth order 
approximation of the implied volatility as 


.. log К — logs 
ФК, T) — qs. t) 


OBS 


In the time homogeneous case, 


d 
а) 


and we recover the ВВЕ formula as in Berestycki et al. [2] and Gatheral et al. [9]. 


K 
et) e) = J 


4.2 Recovery of the Variational-Most-Likely-Path (vMLP) 
Approximation of Gatheral and Wang [8] 


The path integral term in (4.4) is in x-space. Alternatively, in s-space it reads 


d 3T | as(sr.T) Pe 
/ е 2Jt |а(т,т) ' 2 Dis], 
Cs 


where 
1 imd ds; 


1 
D[s] 2 lim А 
п> оо A2 Ata(sp, Т) П A/2n At alsi, ti) 


Hence, we can rewrite the left hand side of (4.4) in s-space as 


| as (87,7) 2 


оо dep 3T 
C(t, st, K,T) = (sr -ю | e ih E E | рдуу, 
К C; 


The variational most-likely-path approximation of implied volatility developed in 
аз (S757) 


Gatheral and Wang [8] is obtained by dropping the second term 277^ in the path 
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integral and evaluating the resulting path integral along the path that minimizes the 
functional 


з 2 
1 fT 5 
e $ asm) | dT 


In other words, 


2 
dT 
dsr, 


зт 
a(s¥,7) 


Шш = 
C(s,t, К,Т) ~ | (sr —K)e ^" 
K 


s |a 
an| 27 
subject to the constraints that initial and terminal points are fixed at s; and sr respec- 
tively. Moreover, since the resulting integral is of Laplace type, the call price is given 
asymptotically, up to a factor, by 


where s* is the optimal path that maximizes the action functional ГИ 


2: 
| а 


C(s,t, K,T) ze 


Ж 
57 
а(ѕт.т) 


where the optimal path 5* has initial and terminal points s апа К respectively. Finally, 
by matching the exponent with the Black-Scholes asymptotic as in (4.6), the zeroth 
order approximation of implied volatility is given by 

> 4-4 

а 


Пов К — log s| ra 
OBS: == — 
BS Tot : 


which recovers the variational most-likely-path approximation of the implied volatil- 
ity presented in Gatheral and Wang [8]. 


^ 
5 T 


a(sz, T) 


4.3 New and Improved Most-Likely-Path (MLP) 
Approximation 


As is obvious from our presentation, the approximations obtained in Gatheral et al. 
[9] and in Gatheral and Wang [8] are suboptimal from the perspective of our path 
integral representation (4.4) in the sense that they both drop terms. This suggests that 
we should define the path-integral-most-likely-path to be the path that maximizes 
the full action functional 


1 Т 
5 f |2. — h(xr, l^ dr (4.7) 
t 


or equivalently in s-space the functional 
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1 T Sr ds (Sz, T) A 
=f [+ 2 | m Bn) 


without dropping terms. The Euler-Lagrange equation associated with the functional 
in (4.7) is 


Xp = АА +hy (4.9) 


with boundary conditions х; and хт at times t and T respectively. Matching exponents 
as before gives 


2 


log K — logs + opsvT —t 


T 
x* — h(xt, т)]2ат, 4.10 
AT ; n (xf, т)] (4.10) 


where x7 is the optimal path which maximizes the functional (4.7) (or equivalently 
solves (4.9)) with initial and terminal points given by (5, t) and y(K, T) respec- 
tively. Solving (4.10) for ags yields our new-and-improved zeroth order approxima- 
tion for implied volatility. 

To illustrate the accuracy of our new approximation (4.10), consider the case 
of time dependent Black-Scholes, where rather pleasingly, (4.10) gives the exact 
solution. Note in passing that, to the best of our knowledge, none of the existing 
small time approximations is able to recover this very simple case. 


Example 1 (Implied volatility in the time dependent Black-Scholes model) Assume 
the price S, of the underlying satisfies the following under the pricing measure: 


dS, = о(т) S- d Bz, St = 5. 


In order to apply (4.10), we proceed as follows: 


(a) Transform the model into x-space. 

(b) Solve the Euler-Lagrange equation (4.9) for the optimal path. 

(c) Evaluate the the action functional (4.9) along the optimal path, substitute into 
(4.10) and solve for the implied volatility. 

(a) Transform into x -space: In this case, x = p(s, t) = Н sett = Dele 
Dropping the explicit dependence on f for ease of notation, and applying Ito's 
formula to X; = y(S;, t) we obtain 


dX 


- 


. 1 
= (Si, t)dt + vs(Si, t)d S, + Б Pss CSi t)d[S]; 


/ 
c 2 


Thus h(x, t) = —2 — £x. 
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(b) Solve the Euler-Lagrange equation: The associated Euler-Lagrange equation 
(4.9) in this case reads 


me [(@-@]- 


With the change of variable x — sd the above ODE for x is transformed into the 
following ODE for z 


20! а (Zz 
20520 Е 
е p dT (2) 


With boundary conditions 2; = о; х; and zr = от хт, the solution to the Euler- 
Lagrange equation is given by 


# 
OT XT — Ot Xt 
Or Xr = Zr = орх + cj o^ (s)ds. 
t 


JE a? (s)ds 
(c 


— 


Solve for implied volatility: It follows that. the functional (4.7) evaluated along 


the optimal path, taking into account that 4 = poc is a constant, is given 
" с=(5)а5 
by 


T T 
n lx, неа 
t 

T T 
x 2004, | ат= f 
t 


2 
i — CINE а 1 И 2 
x с=т 
T o2 (s)ds #3 
1 —1 
zd MES SEU vf c? (s)ds 
JSE mom 


Finally, substituting this last expression into (4.10) gives the well-known result 


2 
ат 


с! 
#+у+—х 


+ 2 


2 1 T 2 
ops = 7 | с“ (s)ds, 


which is exact. 
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5 Conclusion 


We have shown, up to first order in 7 — T — f, that the classical heat kernel expan- 
sion can be derived using a novel probabilistic approach. This new probabilistic 
derivation of the heat kernel expansion inspires a path integral representation of the 
transition density; natural definitions of the most-likely-path approximation of the 
transition density, the call price, and the implied volatility then follow. In the time 
homogeneous case, we recover well-known classical results. However, in the time 
inhomogeneous case, we obtain a new asymptotic expansion that generalizes the 
classical one. We showed how the lowest order approximation of Berestycki, Busca 
and Florent as well as the higher order approximations of Gatheral et al. [9] and 
Gatheral and Wang [8] correspond to dropping terms in our lowest order path inte- 
gral representation. We further showed that by restoring the dropped terms, our new 
representation recovers the exact expression for Black-Scholes implied volatility in 
the time-dependent Black-Scholes model, which no existing asymptotic expansion 
technique has so far been able to achieve, to the best of our knowledge. Further appli- 
cations of this promising approach to the important practical problem of accurately 
approximating implied volatility under local volatility is left for future research. 
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Extrapolation Analytics for Dupire's Local 
Volatility 


Peter Friz and Stefan Gerhold 


Abstract We consider wing asymptotics of local volatility surfaces. While our 
recent paper in the journal Risk (De Marco et al. Risk 2:82—87, 2013, [3]) discusses 
our approximation formula from a practical and numerical perspective, the present 
paper focuses on rigorous proofs of the approximations. We apply the saddle point 
method (Heston model) and Hankel contour integration (variance gamma model). 


Keywords Local volatility + Saddle point methods * Contour integration 


1 Introduction 


One of the main objectives in option pricing theory is to price exotic derivatives con- 
sistently with observed vanilla prices. According to the seminal work of Dupire [5], 
this can in principle be achieved, for a one-dimensional underlying, by a model with 
dynamics dS,/S; = o(S;, t)dW;. As opposed to stochastic volatility models, here 
the volatility is a deterministic function of time and current underlying price. Any 
given smooth call price surface C(K, T), for strikes K > 0 and maturities T > 0, 
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can be recovered by a so-called local volatility model dS;/S; = oc(S;, t)d W;, 
where the volatility function is given by Dupire's formula [5] 


20тС 


2 
eis CK, T) = bere 


(1) 


Exotic options can then be priced by Monte Carlo simulation. Local volatility models 
are of considerable practical importance, and serve as building blocks for more 
advanced models, e.g. local-stochastic-volatility (LSV) models. 

In the present paper, we consider local volatility surfaces that arise from call 
prices that are generated by some model for the underlying. Our aim is to turn the 
knowledge of that model's mgf (moment generating function; of log-spot X7) into 
asymptotic results of the corresponding local volatility surface. In [3], we described 
two applications of such approximations. One is to the design of local volatility 
parametrizations, whose asymptotic behavior may be matched to our results. Another 
application concerns model risk. Consider pricing under an “advanced” model (affine 
stochastic volatility, Lévy, etc.; anything with known mgf) versus a local volatility 
model. The relative differences between the prices has been named “toxicity index" 
in [13]. Roughly speaking, it measures the distance of the trade from vanilla options. 
The most consistent way to calculate this index is to use the local volatility model 
generated by the “advanced” model, because only then all vanillas will have zero 
toxicity. When computing the local volatility surface, our accurate approximations 
can then profitably replace other numerical methods in regimes where the latter 
become unstable (see [3] for details). 

We suppose that the underlying price process S; = exp(X;) is a martingale under 
the pricing measure P and write C(K, Т) for its call price surface. For simplicity we 
assume zero interest rate throughout. If C is sufficiently smooth, then the associated 
local volatility function is given by Dupire's formula (1). Recall the main asymptotic 
formula from [3]: 
2am (s, T) 


s(s — 1) s=3(k,T) 


Cine (K, Т) ~ (2) 


where k denotes log-strike, and $ = $(Kk, Т) is determined as solution of the saddle 
point equation 


д 
9," 9 T) =k. (3) 


Here, m(s, T) := log M(s, T) is the logarithm of the moment generating function 
(mgf) M, which is defined by M (s, T) := Eexp(sX7) and is analytic in the (max- 
imal) strip s- (T) < Re(s) < 5- (Т). The numbers 5_ and s+ are called critical 
exponents. In this note, we will use (2) for K — oo, but other asymptotic regimes 
can also be covered [3, 8]; it is thus not only a local-volatility analogue of Lee's 
moment formula [11], but works also for maturity (or joint) asymptotics. 
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As described in [3], formula (2) results from saddle point approximations of 
numerator and denominator of Dupire's formula, after inserting the Fourier repre- 
sentation of the call price: 


згрс — 2 sity, Ee De M(s, T)ds 
К2дккС — Ги е—%5 M (s, T)ds | 


[10,0] 


oi (K, T) = 


(4) 


(The real parts of the contours are in (1, s+).) 

Whereas the focus of [3] is on numerical tests and applications, the present note 
gives proofs for the validity of (2), in the setting of the Heston and of the variance 
gamma model. As regards methodology, the proof for the Heston model uses a 
classical saddle point approach. Its most interesting ingredient, similarly to [7], is 
the use of ODE comparison results to furnish the necessary tail estimates, without 
taking recourse to the explicit form of the Heston mgf. The analysis is thus well 
suited to extension towards other affine stochastic volatility models. For the variance 
gamma model, the saddle point method is not appropriate. We apply another classical 
contour integration approach, based on Hankel contours, which seems to be new in 
mathematical finance. 


2 The Heston Model 


Even though practitioners seem to prefer local-stochastic-volatility models nowadays 
over the classical Heston model, it might still be useful for the two applications 
outlined in the introduction (model risk and parametrization design; recall that the 
large maturity Heston smile motivates the popular SVI parametrization of implied 
volatility [9]). The dynamics of the Heston model are 


dS, = S /Y,dW,, 80 = so > 0, 
dV, = (a -bV)dt -e/ VjdZ,, | Vo vo > 0, 


with a > 0, b < 0, c > 0, and d (W, Z), = pdt with p € (—1, 1). 


Theorem 1 7n the Heston model with p < О (the relevant regime in practice, at least 
for equity models), the asymptotic equivalence (2) holds for k — оо. The explicit 
leading term is 


2 
2 
K,T)~ k, К oo, 5 
kei = DEUS ^ 5° x 
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where К = log (K/So), s+ = s. (T) and 


Ку = Ts (s4 — 1) [22s — 1) — 2pc(s, pc + ь)| (6) 
— 2(s., pc + b) 5 (2s. — 1) = 2рс(з+рс + Д 
+ 4рс[ 256. — D — вурс Б)? ], 


Кз = 2с?з+ (s, — 1) [ча — 1) — (spc + b) | (7) 


Proof It was shown in [3] that the right hand side of (5) asymptotically equals the 
right hand side of (2). It thus remains to show that (2) holds for the Heston model as 
k — oo. 

By the exponential decay of the Heston mgf towards +іоо, the second equality 
in formula (4) is correct for the Heston model. For the saddle point analysis of (4), 
we employ the approximate saddle point 


Sapprox (К) = S4 — pk te. 


where 8 — - E ‚ € denotes the critical slope 
OT* 
e(T) = =- GT). 
Ky 
and 


T*(s) = sup(t > 0: Efe’*'] < оо}. 


This is the same approximate saddle point as in [7]; see there for more details on its 
choice, and the definition of c (T) and T* (s). (In [7], our Sapprox was called simply 5, 
since the exact saddle point of the denominator of (4), defined in (3), did not occur.) 
This approximate saddle may be used for both integrals in (4). As forthe denominator, 
this was carried out in detail in [7], where an expansion of the Heston density Oy g C 
was determined. The analysis of the numerator in (4) is similar, except that a new 
tail estimate is required. But first we discuss the local expansion around the saddle 
point. Let us fix a number a € (2, 3) and define h(k) = k~®. Then, in the central 
range |$ — Sapprox(k)| € h(k), we have 


AN 1 
s(s—1) wu 


= -= (1-097) 


+ O(s, — s) 
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and (cf. formula (19) in [3]) 


д 28° 1 
2— Т) = ———2s 
or" € ) o(s4 — s)? T (=) 


2 

= 29 (1 + 0079? 071) 
с 

= “(+ ogo), 


oO 


Therefore, the local expansions of the two integrands in (4) agree, up to a factor that 
is given by 
20тт(ѕ, T) _ 2k 


s(s—1) _ os, — 1) 


(1+ 0 (1/279), (8) 


where the error term holds uniformly w.r.t. the integration variable s. According to 
Theorem 1.2 of [7], we have 


ji Sapprox +ih(k) e ; А 2 
= | e 5M(s, T)ds ~ Алей A3)k+Aavk p 3/4+а/с (9) 
Um Sapprox —i A (Kk) 


for certain constants А, A» = 20, and Аз = s+ + 1. Analogously, we derive from 
(8) that 


1 Sapprox tih(k) 20 m(s, Т 
2 Zorms, T) ts pats. T )ds 
UT 


(10) 
Sapprox — iA (K) 5 (s m 1) 


2k 
os+(s+ — 1) 


х A,e Ay) rA k p—3/4+a/c7 | 


Dividing (10) by (9) shows our claim (5), provided that the tails |s — Sapprox (k)| > 
h(k) of the integrals can be discarded. For the denominator of (4), this was shown in 
Lemma A.3 of [7]. So we proceed with the numerator. We consider only the upper 
tail, as the lower one is handled by symmetry. By Lemma A.3 of [7], there is a 
constant B > 0 such that 


Sapprox +i В k 
n e “М, T)ds 
Sapprox tih (k) 


< e(1743)k exp(A2 Vk И 38 pr ve ES O(logk)). 
(41) 


From formula (18) in [3] we obtain 


Orm(s, T) 
s(s — 1) 


< const x К 
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for all s on the contour in (11). This estimate can be absorbed into the factor 
exp(O (log k)) in (11), so that we conclude 


D Orm(s, T) 


e 5 M (s, T)ds 
Sapprox КЇЛ (Kk) s m 1) 


< e0 7 АЗ) exp( A; /k — 1g y 2-20 + O(log &)). 
(12) 


This grows slower than the right hand side of (10) (compare the relevant factors 
к—3/4+а/е resp. exp(—3 07 1k3/2-2e)), As for Im(s) > B, it was shown in [7] 
(Lemma A.2) that 


Sapprox +100 
n e М(5, T)ds| = O(exp((1 — A3)k + BA/K)). 


Sapprox +1 B 


This was deduced from the exponential decay of M (s, T) for large Im(s) (Lemma 
A.1 in [7]). The following lemma implies that the new factor Orm(s, T)/(s(s — 1)) 
grows only polynomially, so that the exponential decay of the integrand persists for 
the numerator of (4). This finishes the proof of Theorem 1. 


To state the lemma, recall that m(s, t) = (s,t) + vov(s, t), where ф and w 
satisfy the Riccati equations 


ó-—av, #0) = 0, 
p= $s? з) + асу? +by+spcp, 00) = 0. 


We have to show that m grows only polynomially as Im(s) — оо. Because of 
the Riccati equations, it suffices to show this for v». Let us write i = f + ig and 
s=€+iy. 


Lemma 2 Let T > 0, and assume that the real part © of s stays bounded in some 
interval 1 < © < Emax. Then, there are positive constants С; т (i = 1, 2, 3) such that 
for у > yo, where yo depends only on Emax and the other (fixed) model parameters 
of the Heston model, 


-—C3,ry’ < f(t) < -Cury, 
0 < g(t) < Соту. 
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In fact, we can take 


Сут = 1/ (Зс), 
1 
Сәт = 2 (max = 1) Т, 
2 
C 2 
Сз, т = Т (: + Schr) * 


Proof It follows from the proof of Lemma A.1 in [7] that (e.g. with Ст := T0 = 
1 1 

zA/1/6 < <) 

с = 3e 


1 1 
Ра) < –ТӨу = Tzv 1/6y < ur ia —С\ ту. 


We next provide a similar upper estimate for g. To this end we first show that g = g(t) 
remains > 0 for all times / > 0. The differential equation for g, 


1 
ja Ry =y cfg-—^4g,  g(0)- 0, 


implies the first order Euler estimate 


1 
90) = 90) +15 Q£y - + c? f (09(0) — 7900) | t+ olt) 


1 
= (24у – у) +0(0), 


—— 
>0 


and hence g is positive (even strictly so) on some interval (0, =). Assume this interval 
is maximal in the sense that g(€1) = 0 and g is (strictly) negative on some further 
interval (£1, €2). Clearly then (=) < 0, which contradicts the information from the 
differential equation: indeed, using g(e1) = 0, we obtain the contradiction 


1 
gle) = 5 (26у — y). 
— 
>0 
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The observation that g > 0 is useful to us, since it leads, together with f < —Ci,ry 
and y > 0, to the differential inequality 


EE 
= 5 06у – у) + с 9 – үд 


: (2£y — y) — (ecir + л) 9 


ІЛ 


— М 


2, oy): 


and hence to the upper estimate 
1 
VO <t <T: g) < , Chu DT x у =: Соту. 


We can feed this upper estimate on g back in the differential equation for f to obtain 
a lower estimate 


*=;(ё y? )+5 (e g)-af 

> (2 у? )+©/ суу vf 
=-5 (1427) +5(C ©) TE 
>- (1+ 26%) +i (E-i) -A 

>- (1458r) = ү}, 


where іп the last step we assume that у is large enough so that the extra amount 
subtracted (at least: 1 y?) is larger than 1 G m ©), which remains bounded. We also 
know that f(t) € Суту < Oforall 0 < t < T. It follows that —yf > 0 апа 
omission leads to our final lower bound on f, namely 


j2-(i sa ) F 
= о 2737 + 


This entails immediately 


2. 
C 
Ра) > -T ( + Scr) y? =: =Сзту?. 
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3 The Variance Gamma Model 


The mgf of the variance gamma model is 
M(, Т) = e!(1— 8ys — loys? TM, 


where c, v > O and 6 € R. The “drift” b = v-! log(1 — 0v — 1o?v) is chosen such 
that S = eX becomes a martingale (w.l.o.g., $0 = 1). For fixed T withO < T/v < І, 
the density Oy к C(K, T) of Sr has a singularity at the origin. Indeed, it behaves as 
~ |k|T/v—1. which easily follows from the integral representation of the density [1] 
(as always, k — log K). At the money, the denominator of the Dupire formula (1) 
thus explodes for small T. If T/v > 1, then the density is continuous. This lack of 
smoothness is just an additional issue on top of a common feature of jump models: 
The associated local volatility surface explodes as T — 0, and so the local volatility 
SDE 


48/5 = oic(S, t)aW (13) 


does not make sense on [0, oo) э f. 

However, following [8], we can start a Monte Carlo simulation of (13) at a time 
To > 0 (here, То > v/2) instead of time zero. With the appropriate stochastic initial 
value, sampled from the density Oy x C(K, To), we recover call prices from time То 
on. (То is called = in [8].) This gives a meaning to the local volatility surface of a 
jump model, without appealing to the practically challenging approach of local Lévy 
models [2]. Our aim is not to make this fully rigorous for the variance gamma model 
(or other jump models), which would require to show that (13) admits a unique strong 
solution on [70, oo). Our focus, instead, is on a rigorous proof that (2) is valid in 
this setting. To ensure the validity of the Fourier representations of density and call 
price, we even assume Т / > 1 (instead of T/v > 2) 


Theorem 3 /n the variance gamma model, formula (2) holds for k = log К — oo. 
The explicit leading term is 


2log(k/ T) 
vs4(s¢ — 1)’ 


ez (E, T) ~ k — oo. (14) 


Note that the numerator of (14) is ~ 2 log К. We kept the T-dependence, because 
the same analysis works for fixed k and T — 0, апа in fact for any asymptotic regime 
with k/ T — oo. This is a common feature of Lévy models, since the right-hand side 
of (2) depends on К and T only through К/Т. 


Proof We write the moment generating function as 


M(s,T) = e (S07 (54 —s)(s — S 
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where the critical moments are 


—v0 + 42vo? + 1202? 
7 


VO 


We analyze the denominator of (4), i.e., the density. The arguments for the numerator 
are analogous (see below). The shift k — k+ bT makes it clear that we may w.l.o.g. 
assume that b = 0. The main part of the saddle point equation (3) is T/(v(s.- — 5)) = 
k, and so 


А Т -2 
#=з+— 7 +O ). 


The saddle point approximation of the density then is 


ES ioo 2 Ms Туду eo exp(m(S, t) — ks) 


2іт J ioc V 2nm"($, T) 


The interesting point now is that (15) is wrong for the variance gamma model, 
inasmuch as asymptotic equality does not hold. The algebraic singularity of the mgf 
is not pronounced enough to make the saddle point method work; see also the remark 
after the proof. For a correct analysis, we use an integration contour as in Fig. 1. The 
U-shaped notch, denoted by C(K), extends a bit to the right of the singularity s+, 
and captures enough asymptotic information from it. By transformation into a so 
called Hankel path, Hankel's representation of the Gamma function can be invoked 
after termwise integration of a local expansion. This *Hankel contour approach" is 
well known in analytic combinatorics, in particular, from the so-called singularity 
analysis of generating functions [6]. 

Let us first argue that the integrals over the dashed lines in Fig. | can be discarded. 
By symmetry, it suffices to consider the upper one. The real part of s is then Re(s) — 
s+ + (log к) / k. First suppose that s is away from the singularity, say Im(s) > 1. The 


(15) 


Fig. 1 The contour C(k), a Im(s) 
small notch embracing the n ^ 
critical moment s+ los k | 
ogk | 
kj 
E] 
— 
1 
k 
H Re(s) 
5+ 
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integral of (5+ — s)(s — s_))~"/” over this part of the contour is O(1), and so we 
get the bound O(e7*8*)) = О(е—®*+ / К). Now consider s with 1/k < Im(s) < 1. 
We estimate the resulting integral by the length of the contour, which is O(1), times 
the absolute value of the integrand at the lower endpoint s = s+ + (log k)/ k +i/k. 
The latter is easily seen to be О (e^ + kT/v-! (log k)- T/v), 

We will now show that the integral over C (k) is of order e^ + kT/"-!, so that the 
tail estimates we have just derived are good enough. The factor (s — s_) is locally 
almost constant; we have, uniformly for s € C(k), 


M(s, T) ~ es. = 8) T", К oo, 


where с = ci(T) = (o?v(s,. — s_)/2)~"/”. Therefore, 


—ks —ks СІ 
e ^ M(s, T)ds ~f gl ds, 
|" C) (sz — s)T/v 


The change of variables s = 5+ — w/k transforms this into 


—ks4. k T/v —ks, 
а / есі (=) dw= отт | ew!’ dw 
k Уң w gt ang 


e ks 
айт | e"w TI qw. 
k ^H(oo) 


The integration paths are displayed in Fig. 2. The right one, H (oo), is called a Hankel 
contour; H (k) is a Hankel contour truncated at Re(s) = —log k. Now recall Hankel's 
representation for the Gamma function [12]: 


Im(s) Im(s) 
log k 
1 1 
- Re(s) - ‚ Re(s) 
H(k) ^ (oo) 


Fig.2 The integration contours H(k) and 4 (co). The dots should indicate that the contour H (o0) 
extends to —co 
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1 n d ae 1 
—— ew “dw = ——. 
2i JH Г (2) 
We thus arrive at 

1 ioo 


с e 5 M(s, T)ds ~ ебе T Iv (16) 
2iT J ioco Г(Т/ь) 


The numerator of (4) can be treated analogously, with a very similar tail estimate. 
The contribution of the new factor to the local expansion is 


Orm(s, T) " 2/v lo 1 
s(s — 1) s4(s_ — 1) d 
2/v 
= log — 
S4. s 1) 
2logk 
иѕ (s. — 1)’ 
and so 
ioo 9 Т 2100 к 
jj Огт T) t Mes, T)ds ~ ic x —l g-begT/v-l (47) 
ioo $(s — 1) иѕ+ (5+ = 1)  F(T/v) 


Dividing (17) by (16) yields the desired result. 


As mentioned in the preceding proof, the saddle point formula (15) is not an 
asymptotic equivalence for the variance gamma model. But, as we have shown, our 
formula (2) is still correct. What happens is that (15), and its counterpart for the 
numerator of (4), are almost correct: They are only off by a constant factor. (This 
phenomenon has already been observed for similar integrals in [4].) This constant 
factor is the same for both integrals, and thus cancels in the quotient (4). Therefore, 
our asymptotic formula (2) extends well beyond models where the saddle point 
method is applicable. In fact, we conjecture that the formula holds whenever the mgf 
explodes close to the singularity s+. 


4 Other Jump Models 


Without giving proofs, we briefly discuss local volatility asymptotics for two other 
jump models. The mgf of Kou’s double exponential Lévy jump diffusion is given by 


а252 Лр A-(1— p) 
M(s,T) = T À 1 : 
(ть нА „ЖЕР JJ) 
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The critical moment is 5+ = A, and the saddle point is located at 


А [AA4 pT 
S 25. — as c 


The singularity type, the same as in the Heston model, is amenable to the saddle 
point method. Formula (2) can thus certainly be verified, and yields 


2 Ар n 
JATA,- 1) 


For T — 0, the blowup of local volatility is of order T !/2. (Just as the Hankel 
contour analysis in the proof of Theorem 3 can be carried out for any asymptotic 
regime with k/T — oo, the same is true when applying the saddle point method to 
the local volatility surface of a Lévy model.) 

Finally, we consider the normal inverse Gaussian (NIG) model. The mgf 


оК, T) ~ , k— oo. 


M(s, T) = exp (Tos +6Т (Vo? — f? — Jo? — (B + s) 
has no blow-up at the critical moment 
S+ =Q B, 


but a square-root type singularity, with local expansion 
M (s, T) = еЇ?з++%ТА/о?—8 (1 —6ТУО se 5) | (18) 


It is still true that ok , T) asymptotically depends, via (4), on the local behavior 
of M (s, T) near s}. However, the approximation (2) hinges on the first term of the 
local expansion of M (s, Т). It therefore fails to capture the asymptotics of Ur. (К, Т) 
here, which depend on the first singular term (the term ys} — s in (18)). The NIG 
model is thus one of the few examples where (2) is wrong. (It gives the qualitatively 
correct result of convergence to a constant, but a wrong one.) The Hankel contour 
analysis in the proof of Theorem 3 can be adapted to handle this situation. The result 
is that local volatility tends to a constant for k — oo. This fact may be understood 
by comparing the NIG marginals with those of Heston’s in the time T — oo regime 
(this link is made precise in [10]). In particular, the result is then consistent with the 
Heston asymptotics (5) of local vol, given that the О (k) term carries a factor ~ 1/T 
which tends to zero as T — oo. 
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The Gártner-Ellis Theorem, 
Homogenization, and Affine 
Processes 


Archil Gulisashvili and Josef Teichmann 


Abstract We obtain a first order extension of the large deviation estimates in the 
Gürtner-Ellis theorem. In addition, for a given family of measures, we find a spe- 
cial family of functions having a similar Laplace principle expansion up to order 
one to that of the original family of measures. The construction of the special fam- 
ily of functions mentioned above is based on heat kernel expansions. Some of the 
ideas employed in the paper come from the theory of affine stochastic processes. 
For instance, we provide an explicit expansion with respect to the homogenization 
parameter of the rescaled cumulant generating function in the case of a generic 
continuous affine process. We also compute the coefficients in the homogenization 
expansion for the Heston model that is one of the most popular stock price models 
with stochastic volatility. 


Keywords Affine process * Large deviation principle * Heat kernel expansion - 
Short time asymptotics * Laplace method + Small maturity limit in affine models 
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1 Introduction 


The large deviations theory has found numerous applications in mathematical finance 
(see, e.g., [19]). For instance, using the methods of the large deviations theory, one 
can estimate various important characteristics of financial models such as tails of 
asset price distributions, option pricing functions, and the implied volatility (see, 
e.g., [7-11, 13, 15] and the references therein). A popular source of information on 
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the large deviations theory is the book [4] by Dembo and Zeitouni. A useful result 
in the theory is the Gártner-Ellis theorem (see [6, 12], see also [4]). This theorem 
allows to infer the upper and lower estimates in the large deviation principle knowing 
the properties of the limiting cumulant generating function. 

We will next provide a brief overview of the contents of the paper. In Sect.2, a 
new notion of Laplace principle equivalent expansions for families of functions and 
measures is introduced. This notion is motivated by the homogenization expansion 
of the rescaled cumulant generating function associated with an affine stochastic 
process X, that is, the function A defined by 


A(e, и) = €log :[ехр {-=Xe} | = clog f exp [-2z] pe(dz). 


Actually, the homogenization expansion mentioned above is nothing else but the real 
analytic expansion of the function A with respect to the parameter є (see Sect. 4). 
In Sect.3, we gather definitions and known facts from the theory of general affine 
processes, while in Sect. 4, the homogenization procedure is described in all details 
for continuous affine processes. The main general results obtained in the paper are 
contained in Sect. 2 (see Theorems 2.4 and 2.7). Theorem 2.4 states that for any family 
of measures on the real line, satisfying the conditions in the Gartner-Ellis theorem, 
and such that the homogenization expansion exists, we can find a special family of 
functions that is Laplace principle equivalent to the original family of measures. The 
structure of the function family in Theorem 2.4 resembles the first two terms in the 
heat kernel expansions on Riemannian manifolds (notice that we face a degenerate 
situation here, so we could not apply heat kernel expansion directly). Theorem 2.7 is a 
generalization of the Gártner-Ellis theorem. It is shown in Theorem 2.7 that under the 
same conditions as in Theorem 2.4, the first order large deviation estimates are valid. 
Finally, in Sect. 5, we compute the coefficients in the homogenization expansion for 
the correlated Heston model that is one of the most popular stochastic stock price 
models with stochastic volatility. 


2 Distributions with Equivalent Laplace 
Principle Expansions 


Laplace's principle is an asymptotic expansion technique, which allows one to 
approximate integrals of the form 


b 
(oes |- 29 |: (2.1) 


as є — 0. We will next formulate a rather general version of Laplace’s principle that 
will be used in the sequel. Suppose the following conditions hold: 
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The functions f and ¢ in (2.1) are continuous on the interval (a, b), and the integral 
in (2.1) converges absolutely for all 0 < є < єр. 

The function ¢ has a unique absolute minimum that occurs at z = zo with а < 
zo < b. 

The function ¢ is strictly convex in a neighborhood of zo. 

The function ¢ is four times continuously differentiable in a neighborhood of zo, 
and 


4 
ФО) = $0) + >) 


n=2 


д" 
a (z 20) + О (с = z") (2.2) 
n 


as Z — 20. 
The formula in (2.2) can be differentiated. More exactly, the condition 


А Е 4 д" ozo) 2-14 0 4 "T 
OO = У ccm" tO (0), zo: 03) 
п=2 


holds. 
The function f is twice continuously differentiable in a neighborhood of zo, and 


2 
д" 
f) => e-z" c 0 (e-z) (2.4) 


n=0 
as 2 — Zo. 


Then, as є > 0, 


roep|-* | a 


Е (zo) OF (zo) . S(059(z9)) f(zo) 
Le | c Rese 5l ta) + (Sage + 24002900)? 
9 660) Go) осоо) " o(?)| 

8(92ф(20)) 2(002ф(20)) 


b 


(2.5) 


Formula (2.5) can be derived by following the proof of Theorem 8.1 in [18]. 
Let us next assume that weaker differentiability restrictions than those listed above 
are imposed on the functions f and ¢: 


e The function ¢ is twice continuously differentiable in a neighborhood of zo, and 


у= ий eub 207 с 20)? + 0 (& — ao?) (2.6) 


as z — 20. 
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e The formula in (2.2) can be differentiated. More exactly, the condition 


OH) = O° b(z0)(z — zo) + О (e — zo”) as z — 0 (2.7) 


holds. 
e The function f is such that 


Ра) = fo) + О (z— zo) as 72> zo. (2.8) 


Then, as є > 0, 


b 
/ f(z) exp |- ФО) | dz — exp | Pen) | 06 [лсо +0 o]. (2.9) 


e O? Q(zo) 


Remark 2.1 Using the Taylor formula, we see that (2.2), (2.3), and (2.4) hold pro- 
vided that the function f is three times continuously differentiable and the function ф 
is five times continuously differentiable near zo. Similarly, (2.6), (2.7), and (2.8) hold 
if f is continuously differentiable and ¢ is three times continuously differentiable 
near zo. 


Let p = {рє}, be a family of probability measures on IR. The following assump- 
tion is modeled on the behavior of the family of moment generating functions of the 
affine process and on the homogenization ideas (see Sect. 4 for more details): 


AO (u) 


[ow Б pe(dz) = exp ( ) exp (А w) (: + єл (y) + oi) 


(2.10) 


as є > 0, where AU, 0 < k < 2, are continuous functions on the domain 7. The 
big О estimate in (2.10) is uniform on all closed intervals contained in /. 

It is not hard to see that the functions A“, 0 x i x 2, in (2.10) can be recovered 
from the following formulas: 


и 
AO (y) = lim clog | exp {-=2| p«(dz), (2.11) 


(0) 
exp [A00] = lim ш f of- pe(dz), (2.12) 
c0 € R € 
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exp faa] AO (u) 


(0) 
= шп 1 СЕ ш) |/ ехр [-54] pe(dz) — exp [sw]. . (213) 
e>0 € € R € 


It will be assumed throughout the rest of the paper that the conditions in the 
Gürtner-Ellis theorem hold. More precisely, we suppose that the following are true: 


e The function A) defined in (2.11) exists as an extended real number for all u € В. 
We denote by / the maximum open interval such that the number A ( (и) is finite 
for all u € I. 

e The point и = 0 belongs to the interval 7. 

e The function A® is continuously differentiable on Z, the derivative ð, AO is a 
strictly increasing function on 7, and the range of the function д, A® 15 R. 


The previous restrictions concern only the function A). By the Gartner-Ellis 
theorem, they imply the validity of the large deviation principle for the family p. 
More information on the Gartner-Ellis theorem can be found in [4]. The existence 
of the functions Л) and ЛО) (these functions are determined from (2.12) and 
(2.13), respectively), signals that certain refinements of large deviation results may 
be possible. 


Remark 2.2 In the paper [16] of Jacquier and Roome, an assumption similar to that 
in (2.10) is imposed on the rescaled cumulant generating function (see (2.1) in [16]). 
Moreover, there are more similarities between the assumptions in the present section 
and those in Sect.2 of [16]. Note that the main results obtained in [16] concern the 
asymptotic behavior of forward start options and forward smiles. 


The function A is strictly convex on 7. Let us define an appropriate Legendre- 
Fenchel transform of A), more precisely, we put 


* 
[a] ()- —inf(uz- AW), єй. 


It is clear that there exists a unique minimizer z +> u*(z) in the problem described 
above, satisfying the condition 


ð AO (u* (z)) = —z. (2.14) 


It follows that " 
[a8] (z) = —zu*(z) — AO (u* (z)). (2.15) 


Since A (0) = 0, we have [A] (2) > 0. It is well-known that the function 
[A9] is strictly convex on IR. The previous statements, (2.14), and (2.15) imply 
that [А @]* (z) = 0 if = —8, A (0), and [А @] (z) > 0 if z # —0,A (0). 
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d(z) = J2 [A9]* (о). (2.16) 


d?(z) u (0) * 
= [А | (о). (2.17) 


Next, set 


It is clear that 


Therefore, 


d(z) = /-2 [zu*(z) + A (u*(z))]. (2.18) 


By the strict convexity of the function A), 


inf 
ze 


| 4? (©) 
2 


uz+ 2] = —A® (u), uel. 


Let p be a family of Borel probability measures satisfying condition (2.10). Our 
next goal is to find a special family of functions Ё = { f-}-.¢9 on IR, for which the 
asymptotic behavior of rescaled moment generating functions resembles the behavior 
described in formula (2.10). It would be tempting to try to find an appropriate family 
f among the families of functions satisfying the following condition as e — 0: 


[ew |] fe(z)dz = a = | exp [A900] 


x (1 +eAVu) +0 (-)) (2.19) 


uniformly on compact subintervals of 7, where the functions AGO,0 < k x2,arethe 
same as in (2.10). However, we can not always guarantee the existence of the integral 
on the left-hand side of formula (2.19) due to the lack of control of the tail-behavior 
of the function fe. The remedy here is to localize the condition in (2.19). 


Definition 2.3 Let p be a family of Borel probability measures such that (2.10) 
holds. We say that a family f of continuous functions on IR is Laplace principle 
equivalent up to order 1 to the family p provided that the following conditions hold: 


(i) For every n > 1 there exists a proper open subinterval J, C J of the interval 7 
such that as є > 0, 


/ "exp [s] fe(z)dz = exp | С, | ехр faa] 


(1 + eA (и) + Onu (°)) 


for all u € Jy. 
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(ii) The sequence of intervals Jp, n > 1, is increasing and Ux | Jn = Г. 


The next statement explains how to construct the family f. The ansatz, defining 
the structure of the function f. in formula (2.20), is based on the classical theory of 
heat kernel expansions. 


Theorem 2.4 Let p be a family of Borel probability measures on R satisfying (2.10), 
and suppose the conditions in the Gürtner-Ellis theorem hold. Suppose also that the 
function A® is five times continuously differentiable on I, the function АЧ) is 
three times continuously differentiable on I, and the function AO is continuously 
differentiable on I. Define a family f of functions as follows: 


1 | d? (z) 
ех 
му 27є Р 2є 


where d is given by (2.18), 


Co) = YAOU) exp [A i^y]. 


fz) = 


| (Co(z) + €Ci(z)), є> 0, (2.20) 


апа 


Col) 2AM (и* (2) 5Со(о) [BAO qui») 


Ci(z) = Coz) AO (u*(z)) 


2 24 [02 А ®(и*(с))] 
Co(z) (3 [BA qu*(z)) — 2A (u*(z)) a4 AO i^ C) 
+ 
8 [02A 9 ue] 
8Co(z) BAO (u* (z)) 


202 AO (u* (2)) 
Then the family f is Laplace principle equivalent up to order 1 to the family p. 


Proof 'The differentiability restrictions on the functions AO0,0 «i < 2, in the 
formulation of Theorem 2.4 are imposed because otherwise the functions Co and 
C, are not defined. Note that the function z +> u*(z) is three times continuously 
differentiable on the real line. The previous statement easily follows from (2.14). 

The proof of Theorem 2.4 is based on the following construction, which uses 
Laplace's principle. For every n > 1, we have 


f eol) поа: 


n 


n 2 
exp [-: (« + 2:3] (Co(z) + €C1(2)dz. (2.21) 


1 
A 27€ 
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Set 
d^(z) 


Qu(z) = uz + ? 


(2.22) 


Laplace's principle will be applied to the family of integrals appearing on the right- 
hand side of (2.21) twice. The first time, formula (2.5) with f = Co and ф = dy 
will be used, while for the second time, formula (2.9) will be used with f = С and 
ф = фи. 

The critical point z*(u) of the function ¢, given by (2.22) is the solution to the 
equation à; [A9] (<) = и. It is not hard to see that z = z*(u) if and only if 
и = u*(z). It follows from (2.14) that 


z'(u) = -0NO (и), uel. (2.23) 


The next formulas can be derived using (2.17), (2.15), (2.22), and (2.23). We have 


& by(z*(u)) = AUG’ (2.24) 
" 09 A) (y) 
826. (2 (u)) — [PAO Gp (2.25) 
and 3 A (0) 2 2 A (0) 4 л (0) 
Әф, (с^ (и)) = асе OT (2.26) 


[PAM WP 
Let us define the intervals J„ appearing in Definition 2.3 as follows: 
Jn = {u € I : (и) € (—n,n)}, п> 1. 


It is not hard to see that condition (ii) in Definition 2.3 is satisfied. Next, using (2.5) 
and (2.21), we obtain 


n u Qu (z* (u)) 1 
ГА ехр Б fe(z)dz = exp | € | 02.6(z*(u)) 


C U)  5(ABdu(z*(u))) Co(z* Q0) 
C * С * 2 2 
| o(z (u)) + ( 1( (и)) Р 202ф,(с*(и)) 24(02 pu (z*(u))) 
4 * * 3 * * 
д; Фи(с*(и))Со(с*(и)) O; Pu (z* (u))0:Co(z =" 3c, (e) | (2.27) 


8(026, (z*(u))) 2(д2ф„ (2*(и)))? 
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as є — 0. Note that the differentiability conditions in Theorem 2.4 allow us to use 
formulas (2.5) and (2.9) with the functions f and ¢ chosen above. 
We will next compare the formulas in (2.10) and (2.27). Note that 


Qu (z* (u)) = u z* (u) — z* (u) u* (z* u)) — AO (u*(z*(u))) = — A9 (u). 


This shows that if we choose the function d as in (2.16), then the first factors in 
formulas (2.10) and (2.27) coincide. Moreover, the functions Со and C, have to be 


chosen so that 
Co(z*(u)) = J 02 A) (и) exp( AU) (u)) (2.28) 


and 


8iCo(z*Q)) — 5(д2ф„(*(и)))” Co(z* Q0) 
202 0, (z* (u)) 24(д2д„ (z* (u))) 
Piu C U Coz*G)) | 86v GW) 0; Coz* G0) 

8(026, (z* ()))? 2(02ф,(2*(и))) ——- 


Ci" (u)) = Co(z*(u)) AP (и) 


(2.29) 


The representations of the functions Co and C, given in Theorem 2.4 can be 
obtained by plugging и = u*(z) into (2.28) and (2.29), and simplifying the resulting 
formulas. Equalities (2.23)-(2.26) are taken into account in the simplifications. 

This completes the proof of Theorem 2.4. 


Remark 2.5 We have already established that [A9] (y) = 0 for all y є К. Since 
(2.14) and (2.15) hold, we have 


д [ay (y) = —u*(y) 


for all y є К. Hence the infimum of the function [А @]* on the real line is attained 
at the point y such that и*(у) = 0. This point is given by y = z*(0) = OA (0). 
Moreover, 

* 
inf [A] o» = -A9'() = o. 


ye 


Remark 2.6 А heuristic conclusion that can be reached using Theorem 2.4 is that the 
family f is a small-time approximation to the family p in a certain very weak sense. 
Finding such approximations is an important problem. We consider our results as 
first modest steps in going beyond the celebrated Gartner-Ellis theorem. 


The next assertion provides a first order large deviation estimate in the Gärtner- 
Ellis theorem for families of measures satisfying condition (2.10). Higher order 
estimates can also be found, but we do not include them in the present paper. Let A 
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be a bounded Borel set. Denote by A the closure of the set A, and let 


а? =sup{z} and а = inf(z). 
zeEA 


zeA 


Then we have zt, z7 € A. 


Theorem 2.7 Let p be a family of probability Borel measures on R such that (2.10) 
holds. Suppose also that the function A is twice continuously differentiable on I 
and the conditions in the Gdrtner-Ellis theorem hold (see the conditions listed after 
formula (2.13)). Suppose also that A C R is a bounded Borel set, and x € A. Then 
the following are true: 


(i) If x > OA (0), then as є — 0, 


[А @] (х) – u*(x) (at — x) 


€ 


р‹(А) < af les {Auta} 
х (1 + «A (u*(x)) + ov). Q.30) 


(ii) If x < ON (0), then as e — 0, 


[AOT (х) – |и*(х)| @ — a7) 


€ 


pe(4) < a | exp {A * x} 


x (: + cAO (u*(x)) + ov). (2.31) 


The big O estimates in (2.30) and (2.31) are uniform with respect to x € A. 


Remark 2.8 The conditions x > 2Л((0) and x < ðA (0) are equivalent to 
u* (x) > 0 and u*(x) < 0, respectively. 


Theorem 2.9 Let p be a family of probability Borel measures on R such that (2.10) 
holds. Suppose also that the function A® is twice continuously differentiable on I 
and the conditions in the Gdrtner-Ellis theorem hold (see the conditions listed after 
formula (2.13)). Suppose also that A C R is a bounded open set, and x € A. Then 
the following are true: 


(i) Let x > ЛО (0). Then there exists a constant ^A > О depending on the set 
A such that as є — 0, 
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oy А o 
paza] Е | ap avaro] 
x (1 — exp Б) ( + AP (u* (0) + ov). Q.32) 


(ii) If x < OA (0), then as e — 0, 


(0)]* * + 
pA) > e| See ҮШҮ 
х (1 — exp Б) ( + eAO (u(x) + ov). (2.33) 


The constant үд in (2.33) is the same as in (2.32), and the big О estimates in (2.32) 
and (2.33) are uniform with respect to x € A. 


Remark 2.10 Note that performing the transformation lim sup,_, 9 elog p«(A) in the 
upper estimates in Theorem 2.7, we obtain the upper estimate in the large deviation 
principle for any bounded Borel set A. This gives a little more than the upper estimate 
in the Gartner-Ellis theorem. However, we should not forget that formula (2.30) was 
derived under a stronger restriction (2.10), than in the Gártner-Ellis theorem. 


Proof of Theorem 2.7 We borrow some ideas from the proofs of Cramer's theorem 
and the Gartner-Ellis theorem given in [4]. The proofs of the upper estimates in those 
theorems use Chebyshev's inequality. In our case, due to a special structure of the 
problem, we can provide a slightly more direct proof. 

Suppose the conditions in Theorem 2.7 hold, and let u є J and e > 0. Then we 
have 


uz { uz 
/ ехр [-5] pe«(dz) > p-(A) inf [exp -S ; (2.34) 
A € zeA € 
It follows from (2.34) that for every и є J there exists £(u) € А such that 


p (A) sep [ “| f ee аә 
A € 


€ 


(0) 
та AO} Гар {-=<} D<(dz) 
€ A € 


| A (и) + xu + u(£(u) — х) | 
х ехр ; 


€ 


Indeed, we can take (и) = at if u > 0 and £(u) = a7 ifu < 0. 
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Next, by plugging и = u*(x) into the previous equalities and taking into account 
condition (2.10), we get 


(0) p, * 
p«(A) exp AO) / exp |[- 78] p. (dz) 
A 


€ 


€ 


| [AO] (x) — u* (x)(£(u* (x)) — x) | 
х ехр 


(0)]* руж ж _ 
О [A] ө) EUM (х)) "| 


х (: + eA (u* (x)) + ov) (2.35) 


as є — 0. Now, it is not hard to see that (2.35) implies Theorem 2.7. 


Proof of Theorem 2.9 The lower bounds given in Theorem 2.9 are more delicate. 
Here we start with the estimate 


/ е Б Pe(dz) € ре(А) sup [exp I 


zeA 


instead of the estimate in (2.34). This implies that 


pay = exp [ 192] f exp [- 5] pte 


(0) 
ЕСЕ pe«(dz) 
€ A € 


| AO (и) + xu + unu) — 2 
x exp , 


€ 


for all и € I, where т(и) = a^ if и > 0 and (и) = ат if u < 0. Therefore 


(0) (, * 
pa) sas | ACU) [o |- 28] „аә 
A 


€ 


| [OT (х) — u* (x) (ņn(u* (x)) — x) 
x exp 


€ 


| ' (2.36) 


Our next goal is to use the change of measure method. Consider a new family p 
of probability measures defined by 


The Gártner-Ellis Theorem, Homogenization, and Affine Processes 299 


exp f- 7 "шу | ра) 


Jig exp [- 85] pe(dz)_ 


Pe(dz) = € » 0. 


Note that the family p depends on x. Then inequality (2.36) and condition (2.10) 
imply that 


(0) p, * 
p«(A) БЕСЕ pe(dz) p«(A) 


€ 


| [AO (x) — и*(х) (п(и*(х)) — 2 
x exp 


€ 


—exp [A906] (1 + eA D (u*(x)) + ov) De(A) 


ACQ * ED * = 
ы =] [AO T (x) — wwe) У) с 
є 
as є 0. 
We will next estimate the quantity 
ре(А) = 1 — ре(А) (2.38) 


from below. This will be done using the upper estimate in the Gartner-Ellis theorem. 
Let us denote by A) the function defined by (2.11) for the family p instead of the 
family p. Then it is not hard to see that 

A (y) = AO (y c u*(x)) - AD (u*(x)), vel, (2.39) 


where T = [= u* (x). The function AO and the interval T depend on x. It is clear 
that 0 € J. Moreover, 


Е 


Next, taking into account that А“ is a closed set, and using the upper large deviations 
estimate in the Gártner-Ellis theorem (see Theorem 2.3.6 in [4]), we obtain 


= * 
lim sup [e log p.(A9] < — inf [a0] (y). 
«>0 yeA* 


Set 64 = infycac ES (y). Using Remark 2.5 and (2.39), we see that the 
unique infimum of the function [A] on the real line is attained at the point 


y= [AO] @ = A9 WO) = x, 
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and is equal to zero. Since x ¢ A^, and the set A^ is closed, we have 64 > 0. 
Therefore, for every т > 0, there exists єг > 0 such that 


—6 
p.(A°) < exp |+] ‚ б<є<е. (2.40) 


Fix any number 7 > 0 with 0 < т < бд, and set уд = дд — т. Then (2.38) and 
(2.40) imply the following estimate: 


Bata) = 1 exp |=], О <є< e. (2.41) 
€ 


Finally, using (2.37) and (2.41), we establish estimate (2.32). 
The proof of Theorem 2.7 is thus completed. 


3 Affine Processes 


Let D be a non-empty Borel subset of the real Euclidian space R^, equipped with 
the Borel c-algebra D, and assume that the affine hull of D is the full space R^. To 
D we add a point 6 that serves as a ‘cemetery state’. Define 


D=DU{d5}, D=a(D,{5}), 


and equip D with the Alexandrov topology, in which any open set with a compact 
complement in D is declared an open neighborhood of 6.! Any continuous function 
f defined on D is extended to D by setting f (ô) = 0. 

Let (©, F, F) be a filtered measurable space, on which a family (D*),.5 of 
probability measures is defined, and assume that F is P*-complete for all x є D 
and that the filtration F is right continuous. Finally, let X be a càdlàg process taking 
values in D, whose transition kernel 


p(x, A) -P(X,€A), (t>0,xe€D, AED) 


is a normal time-homogeneous Markov kernel, for which 6 is absorbing. That is, 
Di x, .) satisfies the following conditions: 


(a) x > pi(x, A) is D-measurable for each (t, A) є Куох D. 
(b) po(x, {x} = 1 forall x € D, 

(с) p; (0, {o}) = 1 forallt > 0 

(d) р(х, D) = 1 for all (t, x) € Ryo x D, and 

(e) the Chapman-Kolmogorov equation 


! Note that the topology of enters our assumptions in a subtle way: We require later that X is 
càdlàg on D, which is a property for which the topology matters. 
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Pr+s(x,d€) = [ ноа Ps (x, dy) 


holds for each t, > 0 and (x, d£) є DxD. 


We equip R? with the canonical inner product (, }, and associate to D the set 
U с СЯ defined by 


U= [u € С“: sup Re (u, x) <o}. 
xeD 


Note that the set Lf is the set of complex vectors и such that the exponential function 
x к> е) is bounded on D. It is easy to see that U is a convex cone and always 
contains the set of purely imaginary vectors iR. 


Definition 3.1 (Affine processes) A stochastic process X is called affine with state 
space D, if the transition kernel p; (х, d£) of X satisfies the following conditions: 


(1) It is stochastically continuous, i.e., lims_.; ps (x,.) = pi;(x,.) weakly for all 
t z0,xe€ D. 

(i) The Fourier-Laplace transform of the kernel depends on the initial state in the 
following way: there exist functions  : £59 xU > Candy: Rzox& > Cc, 
such that 


/, el") р,(х, d£) = O(t, и) ехр((х, W(t, и))) (3.1) 


for all є Ко, x € D, and u є U. 


Remark 3.2 Note that the previous definition does not specify w(t, и) in a unique 
way. However, there is a natural unique choice for v that will be discussed in 
Proposition 3.3. Also note that as long as ®(t, и) is non-zero, there exists Ф(Т, и) 
such that Ф(т, и) = е0, and equality (3.1) becomes 


|, e €" p (x, d£) = exp (ó(t, u) + (x, Y(t, и))}. (32) 


This is the essentially the definition that was used in [5]. Condition (3.2) means 
that the Fourier-Laplace transform of the transition function is the exponential of an 
affine function of x. This fact is usually interpreted as the reason for the name ‘affine 
process’, even though affine functions also appear in other aspects of affine processes, 
e.g., in the coefficients of the infinitesimal generator, or in the differentiated semi- 
martingale characteristics. We prefer to use equality (3.1) instead of equality (3.2), 
since the former equality leads to a slightly more general definition that avoids the 
necessity of the a-priori assumption that the left hand side of (3.1) is non-zero for 
all t and u. 


Before we start exploring the first simple consequences of Definition 3.1, addi- 
tional notation will be introduced. For any u € U, seto) := inf {t > 0: e(t, и) = 0} 
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and О := {(t, u)elRzoxü:t« a(u)}, and let ф be a function on О such that 
D(t, u) =e? forall (t,u) e О. 

The uniqueness of @ will be discussed below. The functions ф and w have the fol- 

lowing properties (see [17]): 


Proposition 3.3 Let X be an affine process on D. Then 


(i) The condition a (u) > 0 holds for any и € U. 
(ii) The functions ф and w are uniquely defined on Q under the restriction that they 
are jointly continuous and satisfy $(0, 0) = v0, 0) = 0. 
(iii) The function y maps Q into U. 
(iv) The functions ф and 1b satisfy the semi-flow property. For any и € И апа 
t, s > 0 witht + s < o(u), the following conditions hold: 


Ot +5,u) = Ot, и) + Os, Y(t, и), Ф(0, и) =0 
VG t s, u) = Yt, Ys, u)), YO, и) =u 


Remark 3.4 In the sequel, the functions Фф and w will always be chosen according to 
Proposition 3.3. 


We now introduce the important notion of regularity. 


Definition 3.5 An affine process X is called regular if the derivatives 


OQ(t, и) _ A(t, и) 
дї b ше дї 


ї=0+ 


F(u) = 


t=0+ 
exist for all и € U and are continuous at и = 0. 


The next statement illustrates why the regularity is a crucial property. This 
statement was originally established by [5] for affine processes on the state-space 
R” x К^. 


Proposition 3.6 Let X be a regular affine process. Then there exist R4-vectors 
b, gt. — 64; d x d-matrices a, al, ..., ad ; real numbers с, yl, — yf, and signed 
Borel measures m, ut; Mong ud on В V {0} such that the functions F(u) and R(u) 
can be represented as follows: 


pipe sau) + (bu) - e | 


(£u) т. 
; ыш е 717 000) mao. 


(3.3a) 
Riu) = и, fu) (8и) - 9 Ed (e^ =й ШТ), и! (d£) . 
(3.3b) 
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In the previous formulas, h(x) = xljxja1) is a truncation function. In addition, for 
all x € D, the quantities 


A(x) eg dada q^ xg, (3.4a) 
B(x) =b + x18! +--+ + хаб“, (3.4b) 
C(x) = с+ х7! а, (3.4с) 
v(x, d£) = m(d&) + xin! (dE) +++» + xau" (d£) (3.4d) 


have the following properties: A(x) is positive semidefinite, C(x) < 0, and 


| (пе А г), d£) « оо. 


Moreover, for u € И and є [0, c (u)), the functions ф and 1b satisfy the following 
ordinary differential equations: 


д 
5, 7 и) = Е((ї, и)), Ф(0, и) = 0 (3.5а) 
д 
a и) = R(j(t,u), 00, и) =u. (3.5b) 


Remark 3.7 The Eq. (3.5) are called generalized Riccati equations, since they are 
classical Riccati equations when m(d£) = Ш (d£) = 0. Moreover, Eqs. (3.3) and (3.4) 
imply that u œ> F(u) + (R(u), x) is a function of Lévy-Khintchine form for each 
x € D. 


Proof See [17]. 


In general, the parameters (a, al, b, Bi, C, y, m, D ien.) appearing in the rep- 
resentations of F and R in (3.5a) and (3.5b) have to satisfy additional conditions, 
called the admissibility conditions. These conditions guarantee the existence of an 
affine Markov process X with state space D and with prescribed F and R. It is clear 
that such conditions should depend strongly on the geometry of the (boundary of the) 
state space D. Finding such (necessary and sufficient) conditions on the parameters 
for different types of state spaces has been the focus of several publications. For 
D = К, x R”, the admissibility conditions were derived in [5]. For the cone of 


semi-definite matrices D — St. such conditions were found in [2], and for sym- 
metric irreducible cones, the admissibility conditions were found in [3]. Finally, for 
affine diffusions (m = и = 0) on polyhedral cones and on quadratic state spaces, 
the admissibility conditions were given in [20]. 


Definition 3.8 We call the state space D = Ro x R” with m,n > 0 the canonical 
state space. 


Affine processes on canonical state spaces are completely characterized in [5] 
in terms of the admissibility conditions imposed on F and R. Affine processes 
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on canonical state spaces have continuous trajectories (such processes are called 
continuous affine processes) if and only if the functions F and R satisfy the admis- 
sibility conditions and are polynomials of degree at most 2 (see Proposition 3.6). 


4 Homogenization Procedure 


In this section, we consider continuous, affine processes on the canonical state space 
D= К x R”. We will next introduce a natural homogenization procedure, which 
allows to analyze the short-time asymptotics ofthe law of continuous affine processes. 
In the case of affine processes, the homogenization leads in fact to real analytic 
expansions with respect to the homogenization parameter. 

The following lemmas introduce the homogenization procedure. 


Lemma 4.1 Let: x Ко — U be the unique solution of the equation 


д 
gL = RUU D), pu, 0) =u ed, 
where R : U — C4 is a quadratic polynomial. Then, for every є > 0, the function 
v (u, t) :— (=, а) 
є 
solves the equation 
д € [3 € Є 
ar (и, t) = R (4 (и, 1)), y (u,0) =u 


with R‘ (u) := eR(e!u) for u € ut. 
Analogously, let à : U x Ко — С be the unique solution of the equation 


Эи, t) = Е(0(и, t), dw, 0) = 0. 
Then, for every є > 0, the function 
шлу = (5, er) 
solves the equation 
8 
ГДС t) = F (Y (u, 0), (и, 0) = 0 


with Еи) := eF (elu) for u Єй. 


The proof of Lemma 4.1 is simple, and we leave it as an exercise for the reader. 
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Lemma 4.2 Under the previous assumptions, the limit їтїр € = Y® exists 
uniformly on compact sets in И х Rso. Furthermore, 


феи, t) = VO u, t) e (u, 1) + P Qu, 1) (4.1) 


п>2 


is a convergent power series expansion for small є > 0. The coefficient functions in 
(4.1) satisfy certain ordinary differential equations, i.e., in particular, 


2 pou, t) = RO (YOu, 0), YOu, =u, 


and 


д д 
a) = | e (uj) (1) a) = 
— u,t) = — R u,t u,t), u,0) = 0. 
a í ) S us (Yh (и, г) (и, t), vu, 0) 
Forn > 2, the equations for the coefficient functions involve higher order derivatives. 
In complete analogy, the limit lime.o 9€ = ¢© exists uniformly on compact sets in 
U x Ко. Furthermore 


d Qu, п) = 9 (и, п) + eg (и, 1) + У eo Qu 1), 


n>2 
for small enough values of e. 


Proof Observe that R^ = R + eR + S RO and Fo = FO + FY + SF, 
Hence, the vector fields appearing in the equation in Lemma 4.2 are polynomial in u 
and e. Standard results on differential equations with polynomial vector fields yield 
the assertions in Lemma 4.2, in particular, the real analyticity of the solution with 
respect to e. 


Let X be an affine diffusion process with the corresponding functions F and R. 
We can extend the solutions of the Riccati equations described above to maximal 
domains for и є R? ,ie., consider maximal local flows оп IR with the vector fields 
F* апа А“. By A, i > 0, are denoted the functions appearing in the following 
power series expansion in c: 


AO (и) + AMC) +... = фи, 1) + (x, 0-и, D), (4.2) 


They are the solutions of the Riccati equations appearing in the previous lemmas. 
Note that we suppress the dependence on the initial value x on the left-hand side of 
(4.2). The functions A“ exist as extended real numbers for u € R4. 


Remark 4.3 If the expression on right-hand side of (4.2) is finite, then the power 
series on the left-hand side converges absolutely for sufficiently small values of є. 
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Remark 4.4 For continuous affine processes, the homogenization procedure leads 
to the following representation: 


E [exp [^ xo}| = [ow {-(=, 2a} pe(dz) 


i (0) н 
sp [47 eis... (4.3) 


where u is such that the expressions on both sides of (4.3) are finite for small enough 
values of e. 


The representation in (4.3) valid for any continuous affine process was a motivation 
for us for introducing condition (2.10) used in the previous sections. However, the 
expansion in (4.3) is a little different from that in (2.10). 


5 Example: The Heston Model 


In this section, we find explicit formulas for the functions A, 0 <i < 2, associ- 
ated with the log-price process in the Heston model. Let us consider the following 
correlated Heston model: 


dX, = (т + kV)dt + ууа, 
dV, = (a — bV,)dt + a/ V;d 1, (5.1) 


where ғ, k € К, а, b > 0, с > 0, and №; and №; are standard Brownian 
motions with d(Wi, W2); = pdt. We assume that the correlation coefficient р sat- 
isfies the condition —1 < р < 1. In (5.1), X is the log-price process, and V is the 
variance process. The initial conditions for the processes X and V are denoted by 
xo and vo, respectively. The Heston model was introduced in [14]. Note that in the 
present paper we consider the Heston model in which both the log-price and the vari- 
ance equations contain drift terms generated by affine functions. Very often, e.g., in 
[7-10, 16], a special Heston model where k — -i andr — Ois studied. An extended 
Heston model, in which the defining equations contain affine drift terms, is discussed 
in [15]. 

The process X is not an affine process. It is a projection of the two-dimensional 
affine process (X, V) onto the first coordinate. The moment generating function of 
X; is given Бу M;(u) = E [exptu X] = exp {C (u, t) + D(u, t)vo + ихо}, where 


— d(u)t 
Cust) = ru + SS 6 pou + 4ш) 210g (~ gwe )| 
oO 


1— glu) 


The Gártner-Ellis Theorem, Homogenization, and Affine Processes 307 


b + d(u) — pou 1 — е0)? 
Diu, t) == o2 (, — gued 5 
b — pau + d(u) 
glu) = ———_—__ 
b — pau — d (u) 


and 


d(u) — V (оси — by? — o? (2ku + и?) 


(see [1]). Here and in the sequel, the symbol 4/- stands for the principal square root 
function. We will explain below the meaning of the logarithmic function appearing 
in the expression for the function C (see the discussion after formula (5.7)). Note 
that for и = 0, the expressions for the functions C and D should be understood in 
the limiting sense. More precisely, 


С(0, 1) = lim C(u,t) = 0 and р(0, г) = lim D(u,t) =0 
u—0 u—0 


for all > 0. 
It is clear that 


у leet- 7x4] = exp lec. t) + D, t)vo — =] А 


Denote Л (и, t) = t log E [exp( - 2 Х,}]. Then 
u u 
A(u, t) — Ата) ери, (5.2) 


Next, set А (и) = b — pou. It is not hard to see that 


= edt 


— ADEI) „а(и)г 
A(u)—d(u) 


: d(u)t 
1 sinh 5- 


= —(A(u)* — а(и)?) | 
а(и) cosh “4 + A (u) sinh 44 


1 
D(u, t) = za (AW) +d(u)) 


g2 


Moreover, 


; 886" 
u)—atu 
CQ, r) = rut + S| аш) + 4ш) 2108 ( Г Апа) ) 
A(u)—d(u) 
d(u) cosh 4! + A (u) sinh 44 
and = epson 2 ш) 2| 
о? d(u) 
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Using the previous formula, we obtain 


c(- 5.) =-ги 


d(—")t 


d(—* r 
a d(— 7)t cosh —5—— + (bt + pau) sinh — 
+ Sfor + pou 2log 49 . (5.3) 
We also have i 
A(- 7) —b-4po-, 
2 и 2 и 2 и? 
А (-=) =» + 2bpo-— + р0< —, 
Í t t 
é( J _ u’ (1 — P)? à 2си(ко + bp) +12 
t t2 t | 
1 A( ) é( J _ u?  2ku 
с? tJ) £ t’ 
and 
pl) = (ke sinh С; T 
(=) = (2 x) d(—")t (d) 


d (—#) cosh ( E) +A (—#) sinh a 
Let us denote by Z the set of such real numbers u that the expressions on the 


right-hand side of (5.3) and (5.4) are finite for all small enough values of г, and put 


ŝu, t) = d(- JE 


It is easy to see that 


a 1 
Stu, t) = 5v-#d — p*)o? + 2tu(ko? + bpo) + t2b2. 


In the previous formula, t is a real number. Therefore, for every real number и Æ 0, 
S(u, t) is purely imaginary for all numbers г with |¢| small enough. For such и and 
t, S(u, t) — iS(u, t), where 


1 
S(u,t) = „уна p?)c? — 2tu (ko? + bpo) — t?b? (5.5) 
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is a real number. It follows that 


ies ) = —іғи 


2 t b t 
qun зеш Е S(u, t) cos S(u, t) + (bt + pou) sin S(u, t) (5.6) 
c 2S(u, t) 
and 
1 [4 
1D(- A ) = (u? эши) ш ‚ (5Л) 
t 2S(u, t) cos S(u, t) + (bt + pau) sin S(u, t) 


Our next goal is to introduce an additional condition under which the logarithmic 
function appearing in formula (5.6) exists, and the expressions on the right-hand 
sides of (5.6) and (5.7) are finite. Recall that we have assumed that u 4 0 and |ї| is 
small enough. Set 


S(u) = lim [25(u, г) cos S(u, г) + (bt + pou) sin S(u, r)]. 


Then, we have 


1 
lim 5(и, ї) = 5 lov 1-p? 
t> 


and 


S(u) = 


+ ucpsin 


ioe hl 
lu|o v 1 p соз EN P j ED 


Let p 5 0, and assume that 


J1— 2 2 ( =) 
т — arctan . (5.8) 
p 


— ———————— arctan <и < 


сү 1-— p p сү1— р? 


The restriction in (5.8) means that the variable и is bounded from below by the largest 
negative root of the function 


S(u) — =у/1—@?° Гаа Ааай 


апа from above by the smallest positive root of the same function. Note that 500) > 0. 
Therefore, we have S(u) > 0, for all u satisfying the condition in (5.8). 

It is easy to see that 5(и) = o|u|S(u) for all и з 0, satisfying the condition in 
(5.8). Hence, S(u) 7 0, under the same restrictions on u. It follows from (5.7) that 
for all и Æ 0 such that (5.8) holds, the right-hand side of (5.7) is eventually finite as 
t — 0, and moreover 
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u и sin “Yo Er 
lim 1D( DE 2 . (59) 


«(i-o cos * Y5 Бата 


In addition, ће expression under the logarithm sign in (5.6) is eventually positive, 
and 


lim ic( = =, t) =0. (5.10) 


1—0 


In the case where р = 0, the condition in (5.8) becomes 
T T 
= – <и < –. (5.11) 
c т 


The analysis here proceeds similarly to that in the previous case. 

The next statement provides explicit expressions for the function Л. This state- 
ment was obtained in [8] (see formula (2) in [8], see also [10]) in a special case where 
k = —$ andr = 0. 


Theorem 5.1 Suppose p +Æ 0 and condition (5.8) holds. Then и € Z and the 
following formula is valid: 


; № 1—02 

vou sin “YP 7 B 
BENE) исму 1—р? . uar/1—p? 
(1 p^ cos — 5 — + psin —5—— 


If p = 0 and condition (5.11) holds, then u € Z and 


л (и) = xou. (5.12) 


uc 
AO (u) = zo i an — — xou. 
т 2 


Theorem 5.1 follows from (5.2), (5.9), and (5.10). 
Recall that for x € R, the critical point u*(x) is the solution of the equation 


ð AO (и) = —x. Put 0 = 2; = zd . Then, using (5.12), we obtain 


vo p[1 — cos(20u)] + 4/1 — p? sin20u) + o(1 — p*)u 
(/1 — p? cos(8u) + рѕіп(ди))2 


In a special case where p — 0, we have 


д, A (и) = XQ. 


vo sin(20u) + cu 
c 1+ соѕ(20и) 


ð AO (и) = XQ. 
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In the following two statements, we provide formulas for the critical point u* (x) 
and the second derivative of the function AO (u). These results can be used in the 
asymptotic formulas established in the previous sections in the case of the Heston 
model. 


Lemma 5.2 Suppose p 4 0 and condition (5.8) holds. Then, for every x € R, the 
critical point u* (x) is the unique solution to the equation 
p[1 — cos(20u)] + /1 — р2 sin(20u) + о(1 — pu _ 20 
(/1— p? соѕ(ди) + psin(0u))? vo 


If p = 0 and condition (5.11) holds, then for every x є R, u*(x) is the unique 
solution to the equation 


sin(26u)+ou c 
1--cos(20u) ^ vo 


Lemma 5.3 Suppose p 4 0 and condition (5.8) holds. Then 


vo S (и) 


A = 
Iu 2o[/ 1 — p? cos(0u) + рѕіп(ди)]? 


where 


S(u) = Q0 + o 1— р?)[оу/1 — p? ѕіп(ви) + (1 — р”) cos(0u)] 
+ 200(1 — pulv 1 — p ѕіп(ди) — pcos(8u)]. 


If p — 0 and condition (5.11) holds, then 


о A + о) соѕ(ди) + 20cu ѕіп(ди) 


2 (0) 
EE cos? (ди) 


Lemmas 5.2 and 5.3 are straightforward, and their proofs are omitted. 
We will next compute the functions A? and A). Recall that 


A 


fo ( — =z) р (dz) = exp ( E exp (AO (u)) (: +A (и) +.. ). 


Therefore, 


Alu, t) = AO (u) + tA (u) + tlogd t (AO W +...). 
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By differentiating the previous formula with respect to t, we obtain 
A® (u) = lim p t) 
120 Ot” 


and 


2 
А (и) = : - (u, t). (5.13) 


L lim —— 
2 ү д 
Let us fix и Æ 0 such as in Theorem 5.1. Then the function t œ> 5 (и, t) defined 


by (5.5) is real analytic in t in a small neighborhood of т = 0, depending on и. Using 
the Taylor formula, we obtain 


Stu. E esepüd Are joo +0 (°) (5.14) 


as t — 0, where the O-estimate depends on и, and the coefficients are given by 


co(u) = we vi — p, (5.15) 


ju] ko + bp 


и 2/1 — pe’ 


cı (u) = (5.16) 


and 


|u| b2(1 — p?) + (Ко + bp)? 


077 20(1 = p2)? 


и 


Our next goal is to expand the functions f к> sin 5 (и, t) and t њ» cos S(u, t). 
Using the Taylor formula and (5.14), we get 


sin S(u, t) = Оо(и) + 0 (и) + 00? + о@?) (5.17) 
as f — 0, where 
Ug(u) = sin co(u), (5.18) 
(и) = cı (и) cos co(u), (5.19) 
and 
U»(u) = c(u) cos со(и) — cı (u)? sin co(u). 
Similarly, 


cos S(u, t) = Wo(u) + Wi (и) + Emar + O(t) (5.20) 
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as f — 0, where 
Wo(u) — cos co(u), 


Үү (и) = —c; (u) sin co(u), 


and 
М (и) = —[c2 (u) sin co(u) + c1(u? cos co(u)]. 


We will next expand the functions t +> t D(—7, t) and t +> tC(—7. t). It follows 
from (5.14), (5.17), and (5.20) that 


2S(u, t) cos S(u, t) + (bt + pou) sin S(u, t) = Volu) + Vi (u)t + ш? +о@?) 
(5.21) 


as f — 0, where 
Vo(u) = 2со(и) Wo(u) + peuUo(u), 
Vi(u) = 2со(и) Wi (u) + 2с (и) Wo(u) + bUo(u) + pouU: (и), 
and 
Vo(u) = 2co(u) (и) + 4с (и) Wi (и) + 2c2(u) Wo(u) + 2bUi(u) + рси0 (и). 


It is not hard to see that 


Vo(u) = 2co(u) cos co(u) + pou sin co(u), (5.22) 


Vi(u) = (2 + pou)ci(u) cos co(u) 


+ (b — 2co(u)ci(u)) sin co(u), (5.23) 
and 
Ү(и) = [2c2(u) + 2рс (и) + pouc: (u) — 2со(и)сі (u)?] cos со(и) 
— [2со(и)сә(и) + 4с1 (и)? + pouci (u)?] sin со(и). 
Therefore, 


Uo(u) + Uj (u)t + 5U2(u)t? + 0 (0) 
Vo(u) + Vi(u)t + 4 V2(w)t? + O(t) 


DES, t) = (u? — 2tku) (5.24) 


as t — 0 (see (5.7), (5.17), and (5.21)). 
Set 


D, t) = To(u) + Ti (u)t + srw? + O°) (5.25) 
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as t — 0. Then, (5.24) and (5.25) give 
Tou) Volu) = и?0о(и), 
Tou) Vi (и) + Tı (u) Vo(u) = u?Ui(u) — 2kuUo(u), 

and 

1 1 1, 

5 Tou) Va(u) + Tı (и) И (и) + 5 Fa(u) Volu) = U»(u) — 2kuU|(u). 
It follows from the previous equalities that 


u? Ug(u) 


To(u) — Youu)” 


u? Ui (u) Vo(u) — 2kuUo(u) Vo(u) — u?Uo(u) Vi (и) 
Ti(u) — ; 


Vo(u)? 
and 
Tau) = an 
where 


О(и) =u’ Uz (u) Volu)? — AkuUi (u) Volu)? — u?Uo(u) Vo(u) (и) 


— 2u? Ui (u) Vo(u) Vi (и) + 4ku Uo (u) Vo (u) Vi (и) + 2u? Uo (u) Vi (u?.. 
(5.26) 


Therefore, the following asymptotic formula holds: 


DEO”, js u^ Uo(u) т Гал (u) Vo(u) — 2kuUo(u) Vo(u) — u?Uo(u) Vi (и) 
t Vo(u) Vo(u)? 
2 Q(u) 3 
Vow? + O(t^) (5.27) 
ast — 0. 


Now, we turn our attention to the function t — tC [en 1). Using (5.6), we see 
that 


Volu) + И (и)г + Or?) | 
2co(u) + 2с1(и)г + О (t) |` 
(5.28) 


tC( Sps tru +15] 0+ pou 2 log 
t (од 
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Set 
Volu) + Vi(u)t + O (2) 


== 2 
2со(и) + 2c1(u)t + О (2) = Lo(u) + L1(u)t + O(t^) 


as t — 0. It is not hard to see that 


_ Volu) 
Го(и) d 2со(и) , (5.29) 
co(u) Vi(u) — eio) Vou) 
Li(u) = Dep (uy? (5.30) 
We also have 
2 L\(u) 2 
log[Lo(u) + Li (и) + O(t^)] = log Lo(u) + т P y + O(t^) (5.31) 


as t — 0. It follows from (5.28)-(5.31) that 


Ci ту = E ru = og 1090 | 
t т o4 2co(u) 
di ab 2a co(u)Vi(u) — cı (u) Vo(u) 
oc? о? co(u) Vo(u) 


| 7+ о@?) (5.32) 


as t — 0. 
Next, we will find explicit expressions for the functions Л( and AO. Suppose 
р 0, и Æ 0, and condition (5.8) holds. Then 


2a Vo(u) 
лои) = (2 — 7) І 
(и) ( с Ё о? "e 2со(и) 


QUI (и)Уо(и) — 2kuUo(u) Vo(u) — u?Uo(u)Vi (и) 
Үо(и)2 


(5.33) 


Formula (5.33) can be established, using (5.27) and (5.32). 
The next statement provides an explicit expression for the function A“ in terms 
of the Heston model parameters. 


Theorem 5.4 Suppose p Æ 0, и 4 0, and condition (5.8) holds. Then 


[S 


ov 1—р? : uoy 1—р 
2 


+ psin 7 
/1—? 
/т=27 A-5 [i= [i= 
E; (u) cos? oF Eg + Ез (и) cos == + Р sin “7 + P + E3(u) sin? wove + p 
vo 


log 


2 V1 — p? cos “ 
AY u) = (= r) u is p 
с 


(5.34) 
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where TERR. 
uc(ka 
AOE а. 
2 
k b 
Ex(u) = —2ko 1 — pl + 
= р 
апа " " 
Ез) = — (2kp0 + b+ ua = г). 


If p = 0, then formula (5.34) holds for all и satisfying condition (5.11). 


Proof Taking into account (5.33), (5.15), (5.16), (5.18), (5.19), (5.22), (5.23),we 
obtain 


М1—г° ; 
Weir + pou sin 


lulo/1 = p 
E 72 = fia Л 2 П 
Е (и) соѕ2 ааа ^ Pub Ез (и) cos шо. /1 g п lle БЕ E3(u) sin? р 


vo , 


ex mp 2 
u|c E cA/1-g 
( lo /1— p? m sin sez) 


Julor/1—p? 
2 


2 u|c4/ 1 — p% cos 
лои) = (®° r)a m lul /1— p? 
o oO 


where 3 


~ u 
E(u) = -zI (ka + bp), 


= ko + b o+b 
Ew =] poulu] P. —2klulov 1 — p? +0+ piu Su 


2/1— р? 2/1— р? 


and 


" k b 
Ез(и) = —и? [уо + р + ис gor 1 А 


2 


Next, replacing |u| by и in the previous formulas (it is not hard to see that this can 
be done) and making several cancellations, we obtain formula (5.34). 

This completes the proof of Theorem 5.4. 

Our final goal in the present section is to find an explicit formula for the function 
AO іп terms of the Heston model parameters. It follows from (5.2), (5.13), (5.27), 
and (5.32) that 


ab  2aco(u)Vi(u) — cı (u)Vo(u) || vo Q(u) 
2 


(2) — 
ше е ооо 2 Vow)?” 


(5.35) 
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where О (и) is given by (5.26). Now, it is clear how to obtain an explicit expression 
for the function A), expressed in terms of the Heston model parameters. It suffices 
to transform the formula in (5.35), using the explicit expressions for the functions 
cj, Ui, Vi with i = 0, 1, 2, and the function О. Let us also note that the value of the 
function on the right-hand of formula (5.35) does not change if we replace |u| by 
u. Taking into account what was said above, and making long but straightforward 


computations, we see that the following statement holds. 


Theorem 5.5 Suppose р Æ 0, и 4 0, and condition (5.8) holds. Then 


AO (u) Z ab a Io(u) 


2 3. 3 A3 — 
d eral Pn ту ТТ а ` P go psin алт й 


vo hu) hu) 


2 =p =p 
| [T= P cos IE + psin EE | 


3? 


where 


2 


1 = 
lo(u) = |-uoo — pl(ko + bp | cos TY 


d [2 + 2pko + uc (ko + bp)(1 — p in * 


I (u) = 


u [2a = p^) + (ko + bp | (ko + bp)? 2 ису = р? 
sin 


20 — p?) 1- 


2 


oVl—p 
2 


, 


(5.36) 


Ь2(1 — р?) + (ko 4-bp) | b(ko + bp) |. uo 1— p uc 1 — p? 
+ + sin 5 cos 2 " 


3 
a(l - p? vi- 


uc /l1— p? 


һо) =2 Е Q+ pow ko + Д " 


2/1— р? 


2 


82 [too + 6+ 2 


and 


uc (kao + а ee uc 1 — p? 


2 
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17 2 
BU) = — ис\/1 — P + b sin? OA + 


ko+bp , uaV/1—p? uc /1— p? 
Jic sin 5 cos 5 2 


If p = 0, then formula (5.36) holds for all u satisfying condition (5.11). 


Proof The second term on the right-hand side of (5.36) can be obtained from the 
corresponding term in (5.35) by taking into account (5.15), (5.16), (5.22), and (5.23). 
Next, using (5.26), we see that 


Qu) _ и? [U2 (u) Vo(u) — Оо(и) Vo (u)] 
Volu)? — Vo(uy? 
à [4ku Vou) + 2и? Vy (и) | [Uo (u) Vi (и) — U1 (и) Vo(u)] 


Vo(u)? 


(5.37) 


Moreover 


U»(u) Vo(u) — Uo(u) Vo(u) = 2со(и)со (и) + 4c1(u? sin? co(u) 
— 2 [с2(и) + bci (u)] sin со(и) cos со(и), 


4ku Vo(u) + 2и? V (и) = 2и [4kco(u) + u(2 + pou)ci(u)] cos co(u) 
dj [2kpo + b — 2co(u)ci(u)] sin co(u), 


and 
Uo(u) Vi (и) — Ui (u) Vo(u) = b sin? со(и) — 2co(u)ci(u) 
+ 2c1(u) sin со(и) cos со(и). 
Set 
П (и) = U»(u)Vo(u) — Uo(u) Vo Qu), 
Ь(и) = u~? [7 Vou) + 2u? Vj (0| 
and 


I3(u) = Uo(u)Vi(u) — U1 (u) Vou). 
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Next, taking into account (5.35), (5.37), and using the explicit expressions for the 
functions cj, U;, and Vi with i = 0, 1, 2, which were found above, we obtain (5.36). 
This completes the proof of Theorem 5.5. 


Remark 5.6 The present remark concerns the continuity of the functions A with 
i = 0, 1, 2 on their domain. Recall that AY (0) = 0. It follows from Theorems 5.1, 
5.4, and 5.5 that the functions A are continuous on their domain with a possible 
exception of the point и = 0. However, it is not hard to see, using the explicit 
expressions for the functions A“, provided in the theorems mentioned above, that 


lim AO (и) 20 for i = 0,1,2. 


u>0 
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Asymptotics for d -Dimensional Lévy-Type 
Processes 


Matthew Lorig, Stefano Pagliarani and Andrea Pascucci 


Abstract We consider a general d-dimensional Lévy-type process with killing. 
Combining the classical Dyson series approach with a novel polynomial expansion 
of the generator A(t) of the Lévy-type process, we derive a family of asymptotic 
approximations for transition densities and European-style options prices. Exam- 
ples of stochastic volatility models with jumps are provided in order to illustrate the 
numerical accuracy of our approach. The methods described in this paper extend the 
results from Corielli et al. (SIAM J Financ Math 1:833-867, 2010, [4]), Pagliarani 
and Pascucci (Int. J. Theor. Appl. Financ. 16(8):1—35, 2013, [20]) to Lorig et al. 
(Analytical expansions for parabolic equations, 2013, [13]) for Markov diffusions to 
Markov processes with jumps. 


Keywords Multi-dimensional Lévy-type process with killing * Asymptotic 
approximation * Integro-differential equation * Levy processes · Parametrix 


1 Introduction 


In a multi-dimensional Markovian setting, the time evolution of a market model is 
usually described by the solution X of a Lévy-Itó stochastic differential equation 
(SDE). Such a model allows for features commonly seen in markets, such as sto- 
chastic volatility, jumps, default, co-integration and correlation. Many quantities of 
interest (e.g., option prices, net present values) can be expressed as expectations of 
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the form u(t, x) := E[g(X7)| X; = x]. Under mild conditions, the function u(t, x) 
is the unique classical solution of a partial integro-differential equation (PIDE). 
Unfortunately, closed form and even semi-closed form solutions of these PIDEs are 
available only in rare cases. As such, it is important to develop general methods for 
finding analytical approximations for the solutions of these PIDEs. 

Within the mathematical finance literature, a number of different approaches have 
been taken for finding approximate transition densities and option prices for markets 
described by Markov processes. Most of these techniques involve expansions that 
exploit a small parameter or a limiting case. For example, Benhamou et al. [1] develop 
analytical approximations for models with local volatility and Gaussian jumps in the 
small diffusion and small jump frequency/size limits (see also the recent review 
paper by Bompis and Gobet [2]). Deuschel et al. [5] obtain densities for diffusion 
processes in a small noise limit. Fouque et al. [6] find option prices for Black-Scholes- 
like multiscale models where volatility is driven by two factors, one running on a 
fast scale, one running on a slow scale. Lorig [11], Lorig and Lozano-Carbassé [12] 
extend these multiscale techniques to more general diffusions and to the exponential 
Lévy setting. 

Recently, Pagliarani and Pascucci [19] introduce a method for finding asymptotic 
solutions of parabolic PDEs. The approach, called the adjoint expansion method, is 
extended by Lorig et al. [15], Pagliarani et al. [21] to models with jumps and it was 
further generalized by Lorig et al. [13] to a family of asymptotic expansions for a d- 
dimensional market described by an Itó SDE (i.e., a Markov market with no jumps). 
The method consists of expanding the pricing PDE in polynomial basis functions, 
which results in a nested sequence of Cauchy problems, and deriving analytical 
solutions for these nested Cauchy problems. In this paper, we extend the results of 
Lorig et al. [13], Lorig et al. [15], Pagliarani et al. [21] to the PIDEs that arise when 
markets are described by a d-dimensional Lévy-Itó SDE. Results presented here also 
simplify results from Lorig et al. [13], Lorig et al. [15], Pagliarani et al. [21]. 

The rest of this paper proceeds as follows. In Sect.2 we present a general d- 
dimensional market model. We also describe the kinds of derivative-assets we wish 
to price, and we relate the price of such derivative-assets to the solution of a parabolic 
PIDE. In Sect.3 we introduce the idea of polynomial expansions of the pricing PIDE 
and in Sect. 4, we derive a family of analytical price approximations—one for each 
polynomial expansion of the pricing PIDE. Lastly, in Sect. 5 we provide a numerical 
example, illustrating the versatility and accuracy of our methods. 


2 Market Model 


We take, as given, an equivalent martingale measure Q defined on a complete 
filtered probability space (О, F, {F;,t > 0), Q). АП stochastic processes defined 
below live on this probability space and all expectations are taken with respect 
to Q. The risk-neutral dynamics of our market are described by the following d- 
dimensional Markov Lévy-type process 
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ах, = u(t, X) dt + o (t, X;) dW; «f zdN (t, X,.., dt, dz). 
Rd 


Here W is a standard m-dimensional Brownian motion, and N (.,-, dt, dz), given by 
N(t, x, dt, dz) = №, x, dt, dz) — v(t,x,dz)dt, — (rx) e R* x Rf, 


is a family of compensated Poisson measures on B(IR) & (В). The drift vector 
u and volatility matrix с map u : Ry x R^ > R^ апіс: R} x R > Вх", 
respectively. We assume the Lévy kernel v satisfies 


fmin [ra ier] (д) «oo,  $(d):- sup (и, х,о), Q.D 
R4 (tx) eR. x Rd 


which is rather standard for Lévy-type models. The components of X could represent 
a number of things such as e.g., economic factors, asset prices, indices, or functions 
of these quantities. We assume a risk-free interest rate of the form r(t, X;) where 
r : R} x R? > R. We also introduce a random time ¢, which is given by 


t 
сего: f уб, Хой > e]. y:R, x R > R,, 
0 


with € exponentially distributed and independent of X. The random time ¢ could 
represent the default time of an asset, the arrival of an economic shock, etc. 

Denote by V the no-arbitrage price of a European derivative expiring at time T 
with payoff 


Н(Хт) l>r) + С(Хт) ler; = (H(Xr) – С(Хт)) ligar) + G(X T). 


Itis well known (see, for instance, Jeanblanc et al. [9]) that 


y, = E [en '®®%б‹хтух,]+ 


Е [е 9x99» (nor - б(Хт))!Х‹], ет. 22) 


Thus, to value a European-style option, one must compute functions of the form 


ult, x) := ale JE 6,99 (Хг) | X, 2x]. (2.3) 


Under mild assumptions (see, for instance, Pascucci [22]), the function u, defined 
by (2.3), satisfies the Kolmogorov backward equation 


(д; + A(t) u = 0, u(T, x) = ф(х), x e Rd, (2.4) 
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where the operator A(t) is given explicitly by 
A(t) =| v(t, x, dz) (eos -1-(z v) 
R4 


2 а 
1 т 
т 2 P» (ve JN (t, х)д дк, + Уша, x)8y; — A(t, x), (2.5) 
LjJ= 


i=l 


with 
d 
(z x= ЭЕ Xi, Му t= (05,05, ..., дхи), e^ V3 f(x) = f(x + z). 
isl 


The formal representation of the shift operator еб“) is motivated by the fact that its 


Taylor expansion applied to the function f (x) gives the Taylor expansion of f (x +z) 
about the point x. As in Øksendal and Sulem [18, Chap. 1], we regard the domain of 
A(t) to be all functions f: R7 — R such that A(t) f (x) exists and is finite for all 
x € Rd. 


Remark 2.1 (Martingale property) Let us denote by X“ the ith component of ће 
vector X and assume that 


J e^ v(dz) < oo, 
|2|>1 


for some i < d, with v as in (2.1). If S, := lesne” is supposed to be a traded 
asset then, in order for S to be a martingale, the drift u; must satisfy 


шах) m ya) – | vss doe = 1— 2) = 5 (007) @,х), 
Ra 2 ii 


To see this, set H (x) = e", G(x) = 0 and impose V, = S, in (2.2). 


3 General Expansion Basis 
Let us start by rewriting the differential operator (2.5) in the more compact form 


A(t) := I, v(t, x, dz) (eine —1- (2, v.)) + У ag(t,x)D%, t€R, x e RA, 


la|z2 
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where by standard notations 
d 
а = (a1, ..., o4) € Nd, lal = У о, De = 9%... 9%, 
i=l 


In this section we introduce a family of expansion schemes for A(t), which we shall 
use to construct closed-form approximate solutions (one for each family) of (2.4). 


Definition 3.1 For |a| < 2 and n < N є No, let dan = aa n(t,x) and v, = 
Vn (t, x, dz) be such that the following hold: 


(1) Foranyt є [0, T], а, 4 (t, -) are polynomial functions with ago (t, x) = aa,0(t), 


and for any x € IR? the functions dg, n C, х) belong to L?*([0, T]). 
(1) For any t € [0, Т], x € R4, we have 


w(,x,dz)- >) x?un,a(t, dz), M,cNo (3.1) 
IB|E M, 


where each ъ„ g(t, dz) satisfies condition (2.1). Moreover, Mo = 0, vo > 0 and 
/ e*llyg(t,dz) «oo,  te[0,T], (3.2) 
|z|=1 


for some positive A. 


Then we say that (An (t))o<n<y, defined by 


Arfa У x? (a Vn p(t, dz) ee elei v) Лоа У dax D? f(x) 


IBI M, le|z2 


= ja wt, x, do) (e 99 — 1- (2, Ve) ло) + È nt: x)D* f (x), 


(3.3) 


is an Nth order polynomial expansion of A(t). 


Definition3.1 allows for very general polynomial specifications. The idea is to 
choose an expansion (A, (t)) that closely approximates A(t). The precise sense of this 
approximation will depend on the application. Below, we present three polynomial 
expansions. The first two expansion schemes provide an accurate approximation 
A(t) in a pointwise local sense, under the assumption of smooth coefficients. The 
last expansion scheme approximates A(r) in a global sense and can be applied even 
in the case of discontinuous coefficients. 
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Example 3.2 (Taylor polynomial expansion) Assume the coefficients ag (f,-) € 
C" (В) and that the compensator v takes the form 


v(t, x, dz) — h(t, x, z)v(dz) 
where A(t,-,z) € C" (RÀ) with Л > 0, and v is a Lévy measure. Then, for any 
fixed x є R and n < №, we define v, and ag,n as the nth order term of the Taylor 


expansions of v and a, respectively in the spatial variables x around the point x. 
That is, we set 


p = 
„бох, = V PACED — уба), 


! 
|Bl=n p 
Dydalt,%) — 
ал) = Ye - 3f. la| < 2. 
|Bl=n 
where as usual В! = fi!--- Ви! and x? = a e ae The expansion proposed 


in Lorig et al. [14, 17] is the particular case when v = 0, whereas the expansion 
proposed in Lorig et al. [15, 16] is a particular case when d — 1. 


Example 3.3 (Time-dependent Taylor polynomial expansion) Under the assump- 
tions of Example 3.2, fix a trajectory x : Ry — В“. We then define v, (t, х, dz) and 
dg, n (f, х) as the nth order term of the Taylor expansions of v(t, x, dz) and а, (t, x) 
respectively around x (f). More precisely, we set 


p = 
эф, x, dz) = M PACO DG EPa), 


|Bl=n А 
В - 
алх) = > m (x - (f, l| <2. 
|Bl=n І 


This expansion for the coefficients allows the expansion point x of the Taylor series to 
evolve in time according to the evolution of the underlying process X;. For instance, 
one could choose x(t) = E[X;]. In Lorig et al. [14] this choice results in a highly 
accurate approximation for option prices and implied volatility in the Heston [8] 
model. 


Example 3.4 (Hermite polynomial expansion) Hermite expansions can be useful 
when the diffusion coefficients are discontinuous. A remarkable example in financial 
mathematics is given by the Dupire's local volatility formula for models with jumps 
(see Friz et al. [7]). In some cases, e.g., the well-known Variance-Gamma model, the 
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fundamental solution (i.e., the transition density of the underlying stochastic model) 
has singularities. In such cases, it is natural to approximate it in some LP norm rather 
than in the pointwise sense. For the Hermite expansion centered at x, one sets 


v(t, x, dz) = У (Hg — 3), v(t, dz))rHg(x — X), 
\8|=п 
dan(t, x) = У (Hg — X), aa (t, ))rHg(x — X), la| < 2, 
\8|=п 


where the inner product (.,-)r is an integral over R? with a Gaussian weighting 
centered at x and the functions Hg(x) = Hg, (x1) -- - Hg, (ха) where Н, is the nth 
one-dimensional Hermite polynomial (properly normalized so that (Ha, Нв)г = 
às, gp With б,в being the Kronecker's delta function). 


4 Formal Solution Via Dyson Series 


In this section we present a heuristic argument to pass from an expansion of the 
operator A(t) in (2.5) to an expansion for и, the solution of problem (2.4). The 
following argument is not intended to be rigorous. Rather, the computations that 
follow provide motivation for the price expansion given in Definition 4.1. Throughout 
this section, we will generally omit x-dependence, except where it is needed for 
clarity. To begin, we presume that the operator A(t) can be formally written as 


A(t) = Ao(t) + BO), Bit) = >) As). (4.1) 


n=1 


We insert expansion (4.1) for A(t) into Cauchy problem (2.4) and find 
(9, + Ao(t))u(t) = —B(t)u(t), u(T) = 9. 


Note that, by construction, A(t) is the generator of an additive process. Therefore, 
by Duhamel’s principle, we have 


T 
u(t) = Pot, T)o «f dti Роб, t1) B(t)u(n), (4.2) 
t 
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where Po(t, Т) is the semigroup of operators generated by Ao(t). Inserting expres- 
sion (4.2) for u into the right-hand side of (4.2) and iterating we obtain 


T 
u(t) = Роб, Te + J dn Pott, BEP, Т)е 
t 


T T 
1 / ап f dro Polt, )B(G) Pon, t2)B(ta)u(t2) 
[4 ti 


= Pot, + | an f ар ·· f dt, 
t 


Polt, 1) B(t) Pot, t2)B(t2) - ·· Poltk-1, BU) Polte, T)o (4.3) 
= Polt, De«YY[ an f а - f dt, 
n=lk=1"! n 
>, Polt, t1) Ai, (t1) Po(ti, t2) Ain (t2) «++ 70(@—1. t) Ai, (t) Po (Фк. Т), 
іЄІ, к 
(4.4) 
Ing = {i = (й, in, ..., i € NÉ | iq Hin й = n). (4.5) 


The second-to-last equality (4.3) is known as the Dyson series expansion of u (see, for 
instance, Sect. 5.7 of Sakurai and Tuan [23] or Chap. IX.2.6 of Kato [10]). To obtain 
(4.4) from (4.3) we have used (4.1) to replace B(t) by the infinite sum bias 1 AÁn(t), 
and we have partitioned on the sum of the subscripts of the (A;,). Expansion (4.4) 
motivates the following definition. 


Definition 4.1 Fora fixed Nth order polynomial expansion (A, (ї))о<п<м satisfying 
Definition3.1, we define uy, the Nth order price approximation of u, as 


=! 
= 


N 
= >, Uns (4.6) 
n=0 


where 


uo(t) := Pott, T)¢, 


"NO -M an f. dh- [. dt, 
t 


>, Pott, ti).Ai (1)Ро(, t2) Ain (t2) ++ Po(te-1, th) Ai, (x) Po (tk, T), п> 1. 
i€ln,k 


(4.7) 


Here, Po(t, T) is the semigroup generated by Ao(t) and І, к is as given in (4.5). 
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In Sects.4.1 and 4.2 we will provide explicit expressions for uo and (un)n>1 
respectively. 


4.1 Expression for ug 


In what follows, it will be helpful to recall the definition of the Fourier and inverse 
Fourier transforms. For any function ф in the Schwartz class, we define 


Fourier transform: Ғф]() = ф(&) = | dx g(x)ei 69, 
R4 


1 


Inverse transform: ![G](x) = ф(х) = Ол) 


| dé $e 369. 
Rd 


Recall that by construction Mo = 0 (cf. Definition3.1) and therefore the operator 
Ao(t) has time-dependent coefficients which are independent of x. Then the action 
of the semigroup of operators Po(t, T) of Ag(t) is well-known: 


1 ^ ^ 
ug(t) := Pott, Т)ф = ao |, Pott, x, T, &)ф(—ё) dé (4.8) 
(2л) Јра 
where 

Po(t, x, T, E) = ei Xt P07.) (4.9) 

with 

T 
Pot, T, ё) = Ya | ds ao, o(s) + Vo(t, T, £), (4.10) 
PES А 
апа 


Т . 
Volt, T, £) EL n Gi —1—1{#, 3) vo(s, dz)ds. 
t Rd 


Remark 4.2 We introduce P and eg, the characteristic function and oscillating expo- 
nential, respectively 


^ T D z 
Ê(t, x, Т, E) := [е анаа) eg(x) = ei, (411) 


where ао,о is short-hand for ако о,....0),0- From (2.3) we observe that Pct, x,T,é&)is 
obtained as the special case p = ez. We note that Po (t, x, T, €) in (4.9) represents the 
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Oth order approximation of P (t, x, T, £). More generally, we denote by Р, (t, x, T, &) 
the nth order approximation of P (t, x, Т, ё), obtained by setting ф = eg in (4.7). 


4.2 Expression for иһ 


Remarkably, as the following proposition shows, every u; (t) can be expressed as a 
pseudo-differential operator £, (t, T) acting on uo(t). 


Proposition 4.3 Assume that ф belongs to the Schwartz class, and that Фо in (4.10) 
is a smooth function of the variable &. Then the function иһ defined in (4.7) is given 
explicitly by 
Un(t) = Ly(t, T)uo(t), (4.12) 
where ug is given by (4.8) and 
Си, Т) = > i dr a di- [. di У Ga (1, 1)Gig(t, n) Ga ts te), 
t i€ln,k 
(4.13) 
with I, k as defined in (4.5) and 
Gj (t, tk) = Aj (tk, M(t, t) 
= I. vj (tes M(t, t), dz) (e 9 — 1 = (2, Vad) + У а, M(E, n) DY, 


le] x2 
(4.14) 
tk tk 
M(t, tk) =+ f. [6 ) 6. ад f тов + f C(s)Vxds, 
t 
(4.15) 
m(s) = (a(1,9,...,0),0(8) | a(0,1,.,0,0(5) -> 4@,0,....1),0(8)) . 
2a(2,0,.,0,0($) | @(1,1,...,0),008)  ...  4(0,0,..,1),0(5) 
4(,1,..,0,069)  2a(0,2,..,0,0(9) |... — 4(0,1,., 0,05) 
C(s) — | : . . 
Q(1,0,....1),008) 90,1,...,1),005) ... 2a(0,0,..,2),0 5) 


Moreover, the components of M (t, tk) commute. Therefore the operators (0 ; (t, tk)), 
which are polynomials in M(t, ty) by construction, are well defined. 


Proof The proof consists in showing that the operator G ; (t, t) in (4.14) satisfies 


Polt, tr Aj (th) = G; (t, th) Pott, tk). (4.16) 
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Assuming (4.16) holds, we can use the fact that Po (tx, tk+1) is a semigroup 
Pott, T) = Pott, t1)Po(th, t0): --Po(ty i, t Po(t, Т), <<< zT, 
and we can re-write (4.7) as 
п „Т Т Т 
“=> f% [ diy / ‚% 27 6,086,106, 0Р0, 
= = їЄ1, к 


from which (4.12) and (4.13) follows directly. Thus, we only need to show that 
C j (t, tk) satisfies (4.16). Itis sufficient to investigate how the operator Po (t, tk) Aj (tx) 
acts on the oscillating exponential in (4.11). First, we note that 


Polt, tees (x) = ето e (у), (4.17) 


where Фо (т, tg, 5), as given in (4.10), is a smooth function by condition (3.2). Next, 
we observe that the operator M (t, tg) in (4.15) can be written 


ма, tk) = M(t, tk, —iV,), M(t, tk, £) = —iVz (ol, tk, £) F i(&, x)) d 
(4.18) 


Denote by М; and М; the jth component of М and M respectively. Then, using 
(4.18) we have 
(—i9g)C—-i9g,)e 93 e; (x) = (~ig) M; (t, tk, £)e P es (x) 

= M j(t, tj) 18g oP De, (x) 

= Mj (t, te) Мій, th, &)e e eg (x) 

= Му Milt, tetes (х). (4.19) 
More generally for any multi-index В we have 

(—iVe Pee Dee (x) = (M(t, д) гео te Dee (x). (4.20) 

From (4.19) we deduce that operators M; and М; commute when applied to 
е®о@ к. Neg (x), because so do dg, and дЕ,. Consequently, M; and Му also com- 
mute when applied to eg (x) or any function that admits a representation as a Fourier 


transform. To see this observe that 


M j(t, t) Mi (t, per Peg (x) = Milt, t) M jt the e (у). 
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Therefore, since М; (t, tk) acts on x and not ё we have 


Mj c t ML. peg (х) = Mi t, tk)M j t, tk)eg (х). 


Finally, we compute 


Polt, te) Aj (teg (x) = Polt, tj) f vj, do) (e 9) — 1 — (z, Уу))ер (д) 


+ У; Polt, th )der, j (tk, x) D% eg (x) 


Jo] <2 


= Рой, wf (еї (8) — 1 — i (z, £))vj (te, x, dz)eg (x) 
ра 


lo| x2 
lo| x2 


le|x2 


а ы -1-i 
R 


101<2 


101<2 


101<2 


= Gj (t, tk) Pott, 1 )е (x), 


which concludes the proof. 


+ У GEY'Po(t, th) aa, j (th, x)eg (X) 


= pce —1— i(z, £))vj (tk, —i Vg, dz) Pott, mee (x) 


+ >) GEM as j (te, CiVEYPo(. teg (x) 


= js —1— ilz, £)) vj (tg, Ci Vg, dz)e Poe De, (x) 


+ У GEM ag, j (t, Ci Vg )e Tote De, (x) 


z, £))vj (te, M (t, ty), dz)e Po (ie, (x) 


+ Y GE as j (th, ME, eE e, (x) 


= fa vj f, M(E, tk), dz) (e^ V9) — 1 — (а, У) е0 Hes (x) 


+ У as jk M(t, t) De T0 Cie Dep (x) 


= p vj (te, M, tk), dz) (e^ V) — 1 — (z, Vx) Pole, thee (x) 


+ У: aa, j (th, M(t, tk)) DE Polt, thee (х) 
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(by (3.3)) 


(by (4.17) 


(by (4.20)) 


(by (4.17)) 


(by (4.14)) 


Remark 4.4 Error bounds for the Taylor approximation и у in the scalar case d = 1 


can be found in Lorig et al. [15, 16]. 
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4.3 Fourier Representation for иһ 
Using (4.8), (4.9) and (4.12) we have 


us(t, x) = Ly(t, T)uo(t, x) = 


| Ф.Т) ia oc 
ax fp (Ent, Te") eot. 


The term in parenthesis £,(t, T)e? 6? can be computed explicitly. However, 
L£n(t, T) is, in general, an integro-differential operator (when X is a diffusion 
Ln (t, T) is simply a differential operator). Thus, for models with jumps, computing 
L£y(t, Teix isa challenge. Remarkably, we will show that there exists a first order 


differential operator Ёё (t, T) such that 
£2 (t, T)ei 69 = Let, T)e1 6, (4.21) 


where, for clarity, we have explicitly indicated using superscripts that 7; (t, Т) acts 
on x and Ёё (t, T) acts оп &. With a slight abuse of terminology, we call £s the 
symbol! of the operator Сх (t, T) in (4.13). 

Let us consider the operator M*(t, tk) = M(t, tk) in (4.15) and denote by 
M7 (t, tx) its ith component. The symbol мМ? (t, tk) of M7 (t, tk) is defined analo- 
gously to (4.21), that is 


Mi (t, tE = ME t, met em, 
Explicitly, we have 
ME (t,t) = Fig, t t) — ids,  i=1,...,d, 
where the function F is defined as 


tk : tk tk 
F; (E, t, f) E f 5 (eae = 1) n. дё + f "вй i f (COE); ds. 
R4 Jt t t 


We note that, while МХ” is a first order integro-differential operator, its symbol ME 
is a first order differential operator. For this reason, it is more convenient to use the 
symbol М instead of the operator M*. Note also that 
MEG, MIC, DEE” = MEG, MEE, tE 
= MS, t) ME, et 6? 
= MS, t) MI ше, 


l The operator £i is nota function as in the classical theory of pseudo-differential calculus. However 
e- i63) RE eix) is the symbol of £} (t, T). 
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Since M7 and ME commute when applied to a function that admits a Fourier rep- 
resentation, then М апа At also commute when applied to such functions. In 


particular, the operator (ME (t, ))° ,forfe Nd, is well defined and we have 


(A(t, 1))^ e$ €9 = (M(t, P еї. (4.22) 


From identity (4.22) we obtain directly the expression of the symbol of G; in (4.14). 
Indeed, recalling the expression (3.1) of v; we have 


бал - У, 3 (е — 1 — ile, £)) via Ge dz) (MFC. д)" 


IB| Mj 


+ У GE)" as; (tk, M (ng). 


012 
Thus we have proved the following lemma. 
Lemma 4.5 We have 
п pT Т Т 
£i. Qe an f if СЭ "IR йөз AC) 
= zd іЄ1, к 


(4.23) 


where Ij. as defined іп (4.5). 


The following theorem extends the Fourier pricing formula (4.8) to higher order 
approximations. 


Theorem 4.6 Under the assumptions of Proposition4.3, for any n > 1 we have 
1 5 
Un(t) = —— | Palt, x, T, €)e(—&) dé, (4.24) 
(2zx)* Jipa 


where P, (t, x, T, €) is the nth order term of the approximation of the characteristic 
function of X (cf. Remark 4.2). Explicitly, we have 


PB, (t,x, T, £) := Вб, x, T,£) NC pee 


where Bot, x, T, &) is the Oth order approximation in (4.9) and Ёё (t, T) is the 
differential operator defined in (4.23). 


Asymptotics for d-Dimensional Lévy-Type Processes 335 


Proof We first note that, since the approximating operator СХ acts in the x variables, 
then it commutes? with the Fourier pricing operator (4.8). Thus, by (4.12) combined 
with (4.8), we get 


un(t) = £X(t, Tuo(t) = a fa LE(t, Tyei 9-99. 6 е) dé 


Qd 


1 
= сг} Pot, x, T, (e FE) Lice, Tet 69) Gg) а, 


and the thesis follows from (4.21). 


Remark 4.7 Computing the term in parenthesis above (eis " MEE (t, T)e1 6 =) is 


a straightforward exercise since the symbol Ёё (t, T), given in (4.23), is a differential 
operator. 


Remark 4.8 In case of non-integrable payoffs (e.g. Call and Put options), the Fourier 
representation (4.24) can be easily extended by considering the Fourier transform 
on the imaginary line ё = &, + 1ё{. For instance, since the Call option payoff 
ф(х) = (е^ — ek ) is not integrable, its Fourier transform Q(—£) must be computed 
in a generalized sense by fixing an imaginary component of the Fourier variable 


& « — 


Remark 4.9 Observe that the Nth order approximation (4.6)-(4.24) requires only a 
single Fourier inversion 


bids wc apt. Ê, (t, x, T, E)E) di. 


n=0 


Moreover, when evaluating the inverse transform, the number of dimensions over 
which one must integrate numerically is equal to the number of components of x that 
appear in the option payoff o. This is due to the fact that the Fourier transform of a con- 
stant is a Dirac delta function. In particular, let g(x) = ф(х) with x = (x1, ..., ха), 
for some d' < d. Then we have ф(&) = (27)4~“'@ (E ) 89(&g.1) ++ 89 (64). with 


E = (&,..., Ey), and thus 


(ta) = c ay Èf’ (t. x. T. (ЕЁ, 0) 6 (-E) a£. 


?This was one of the main points of the adjoint expansion method proposed by Pagliarani et al. 
[21]. 
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5 Example: Heston Model with Stochastic Jump-Intensity 


Consider the following model for an asset $ = е^ 


Q assuming zero interest rates 


, Written under the pricing measure 


1 T 
dX, — 757 (04) (е — 1 — ¢) | Zdt + V ZdW, + | сам, Z;, dt, dz), 
R R 
dZ; = k(0 = 2.) + ôy ZidB;, d(W, B) = pdt. 
Note that, just as in the Heston model, the instantaneous volatility of X is given by 
М Zi, where Z is a CIR process. Likewise, the instantaneous arrival rate of jumps of 


size dg is given by Z;v(dz), where v is a Lévy measure satisfying all of the usual 
integrability conditions. The generator A of the process (X, Z) is given by 


1 
A=e(u + 592 [овое -1- cao) 
1 
+к(0 — әд: + 5 207 + pôzðxðy, 


1 
= 5 -f v(dz)(e* — 1 — ¢). 
R 


The characteristic function Pct, х,2,Т,ё):= pets Хт |X; = x, 2 = z] is obtained 
in Carr and Wu [3] by expressing the process X as a time-changed Lévy process. 
One can also obtain the characteristic function by solving for the Fourier transform 
of the fundamental solution corresponding to the operator (д, + A). We have 


Pra T E) espe rr HERE Ne 


0 {= d(&)t 
ceo = (e - oit ca zl m |) 


k —püi£-pd(E) 1- e16 
5° 1— f(£)e? t" 

k — psig + d(&) 

к — pói& —d(é)’ 


dE) = y —81 24 (E) + (к — pi£8,, 
v) = in - hs [ vapelt -1- ie. 


D(t, £) = 


fE) = 
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With an explicit expression for P (t, x, z, T, &) available, the price of a European call 
option can be computed using standard Fourier methods 


1 X . А —ek-ikė 
u(t, x, z) = F P(t, x, z, Т,&)ф(—#), Ф@) = ЧЕБЕР 
E=% +16, §& «-—l. (5.1) 


Note that, since the call option payoff g(x) = (e* — e*)+ is not in L! (R), its Fourier 
transform Q(£) must be computed in a generalized sense by fixing an imaginary 
component of the Fourier variable &; < —1. 

Also of interest are sensitivities of option prices or Greeks. In particular, consider 
the ^ and the Г, which are defined as 


A(t, x, z) := Osu(t, x(s), z) =e "O.u(t, х, zZ), (5.2) 
T(t, x, х) := 82u(t, x(s), z) = e ?* (8? — 8,)u(t, x, z), (5.3) 


where we have used x(s) = log s. When computing terms of the form д u(t, x, z), 
observe that the differential operator 0” acts only on the characteristic function P 
appearing in (5.1) and not on the Fourier transform ф of the payoff o. Likewise, when 
using Theorem 4.6 to compute 07" u, (t, х, 2) = > 07 u; (t, x, z) the differential 
operator д," acts only on Ё, in (4.24). 

Now, we specialize to the case where jumps are normally distributed 


A (e) 
Jma m Op 


In Fig. 1 we plot the implied volatility o corresponding to the exact price u as well 
as the implied volatility о» corresponding to our second order approximation u2. 
To compute o we first compute option prices using (5.1); we then invert the Black- 
Scholes equation numerically in order to obtain the implied volatility o. To compute 
our second order approximation of implied volatility o» we first compute our second 
order approximation for prices u2 using Theorem4.6; we then invert the Black- 
Scholes equation numerically in order to obtain с». Values from Fig. 1 can be found 
in Table 1. In Fig. 2 we plot the exact A as well as our second order approximation ^2. 
In Fig.3 we plot the exact Г as well as our second order approximation I’). Values 
from Figs. 2 and 3 are given in Tables 2 and 3 respectively. Exact Greeks are com- 
puted by combining (5.1)-(5.3). Approximate Greeks are computed by combining 
Theorem 4.6 and Eqs. (5.2) and (5.3). 


v(dg) = 
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t — 0.10 t—0.25 
0.28 + 0.26 р 
0.26 - 0241. 
0.24 | 
0.22 } 
0.22 + 
0.20 P , 
/ 
j 
П ii [^ 
-02 -0.1 E Fi 
E -0.1 0.1 "T7603 
t = 0.50 = 1.00 
0.26 
0.24} 024 . 
0.22 S 0.22 
0.20) 020 7 
m ў 
` ; 
IS / i 
1 | 1 | п 4 | -0.4 -0.2 3 04 
-03 -0.2 -0.1 0.1 -Q2 03 RUE 
Р A —(¢€ —m)? 
и(44) = exp = : 
v2ns? 282 


Fig. 1 For the model considered іп Sect. 5, we plot the implied volatility ø corresponding to the 
exact option price и (solid black) as well as the implied volatility o? corresponding to our second 
order option price approximation их (dashed black). The units of the horizontal axis are log strike 


k := log К. Approximate prices are computed using the Taylor series expansion of A(t) as described 
in Example 3.2. We assume the Lévy measure v is as parametrized above. The following parameters 
are used in all four plots: к 1.15, 0 0.04, ô 0.2, р 0.7,z = Ө, х = 0, m = —0.1, 
5 = 0.2, А = 2.0 


Table 1 Exact implied vols с, second order approximation o» and relative error |(02 — о) /о| 


t — 0.10 


0.2 


0.15 


0.1 


0.05 


0.2797 


0.2478 


0.2269 


0.2133 


0.00 
0.2028 


0.05 
0.1940 


0.1 
0.1881 


0.15 
0.1960 


0.2 
0.2296 


rel. err. 


0.2795 
0.0006 


0.2483 
0.0018 


0.2271 
0.0009 


0.2132 
0.0003 


0.2028 
0.0002 


0.1939 
0.0001 


0.1877 
0.0020 


0.1963 
0.0018 


0.2324 
0.0120 


ї= 0.25 


o 


0.2441 


0.2323 


0.2217 


0.2120 


0.2028 


0.1941 


0.1863 


0.1805 


0.1803 


02 


0.2456 


. | 0.0059 


0.2328 
0.0018 


0.2215 
0.0013 


0.2116 
0.0020 


0.2025 
0.0013 


0.1939 
0.0009 


0.1859 
0.0021 


0.1793 
0.0067 


0.1799 
0.0027 


t= 0.50 


0.2348 


0.2266 


0.2183 


0.2101 


0.202 


0.1940 


0.1864 


0.1796 


0.1743 


0.2350 


0.2254 


0.2168 


0.2088 


0.201 


0.1933 


0.1856 


0.1783 


0.1723 


. | 0.0005 


0.0049 


0.0069 


0.0063 


0.004 


0.0037 


0.0040 


0.0070 


0.0116 


t= 1.00 


0.2268 


0.2204 


0.2138 


0.2072 


0.2005 


0.1939 


0.1875 


0.1813 


0.1757 


rel. err. 


0.2217 
0.0227 


0.2149 
0.0246 


0.2089 
0.0230 


0.2031 
0.0197 


0.1973 
0.0160 


0.1914 
0.0130 


0.1854 
0.0111 


0.1794 
0.0103 


0.1740 
0.0096 


Parameters are the same as those in Fig. | 
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t — 0.10 t = 0.25 
Lor Lor 


0.8 F 0.8 F 


-0.3 -0.2 —0.1 0.1 0.2 0.3 04 -0.2 0.2 04 


Fig.2 For the model considered in Sect. 5, we plot the Delta A corresponding to the exact option 
price u (solid black) as well as the Delta A» corresponding to our second order option price 
approximation из (dashed black). The units of the horizontal axis аге x. Approximate prices are 
computed using the Taylor series expansion of A(t) as described in Example 3.2. We assume the 
Lévy measure v is as given in Fig. 1. The following parameters are used in all four plots: к = 1.15, 
Ө —0.04,8 = 0.2, р 0.7,2= 0, к= 0, т 0.1, s = 0.2, à = 2.0 


340 


M. Lorig et al. 


-0.3 


-02 


-0.1 


-0.4 


-0.2 


0.2 0.4 


Fig.3 Forthe model considered in Sect. 5, we plot the Gamma Г corresponding to the exact option 
price u (solid black) as well as the Gamma Гә corresponding to our second order option price 
approximation из (dashed black). The units of the horizontal axis аге x. Approximate prices are 
computed using the Taylor series expansion of A(t) as described in Example3.2. We assume the 
Lévy measure v is as given in Fig. 1. The following parameters are used in all four plots: к = 1.15, 


Ө = 0.04, 6 


0.2, p 


0.7, = 


Ө, k 


0, т 


0.1, s 


0.2, А 
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Table 3 Exact Gamma Г, second order approximation Гә and relative error |(Г2 —Г)/Г| 

x 0.2 0.15 | —0.1 0.05 000 0.05 |01 0.15 |02 
t=0.10| Г 0.01828 | 0.2978 | 2.159 | 5.539 | 6.288 | 3.831 | 1.446 |0.3779 | 0.0780 
T2 0.01197 | 0.2897 | 2.1760 | 5.5300 | 6.288 | 3.841 | 1.437 |0.3748 | 0.0821 
rel. err. | 0.3452 | 0.0273 | 0.0077 | 0.0015 | 0.0001 | 0.0025 | 0.0061 | 0.0082 | 0.0518 
t—025|T 0.5185 |1705 3.337 |4.275 13.967 | 2.884 |1.738 |0.906 | 0.4229 
T2 0.5267 |1747 |3.334 |4.255 |3.969 |2.907 |1.754 |0.8925 | 0.4016 
rel. err. | 0.0157 |0.024 | 0.0009 | 0.0046 | 0.0003 | 0.0079 | 0.0094 | 0.0149 | 0.0503 


t=0.50| Г 1.514 |2488 |3135 |3.206 |2.802 |2.174 |1.54 |1.017 |0.635 
T» 1.585 2.508 |3.109 |3.182 |2.804 2.208 |1.588 |1.045 | 0.6244 
ге]. err. | 0.0468 | 0.0079 | 0.0081 | 0.0076 | 0.0007 | 0.015 | 0.0309 | 0.0279 | 0.0167 
t= 1.00 Г 2.005 |2425 |2483 |2.306 |1.985 |1.612 |1.251 | 0.9364 | 0.6814 


T2 2.134 |2.418 |2452 |2.280 |1.988 |1.656 |1.331 |1.028 |0.7511 
rel. err. | 0.0183 | 0.0032 | 0.0124 | 0.0110 | 0.0015 | 0.0276 | 0.0644 | 0.097 | 0.1023 


Parameters are the same as those in Fig. 3 


6 Conclusion 


In this paper we derive a family of asymptotic expansions for European option prices 
when the underlying is modeled as a d-dimensional time inhomogeneous Lévy-type 
process. By combining the classical Dyson series expansion with a novel polynomial 
expansion of the generator, we obtain two equivalent representations for approximate 
option price: (i) as an integro-differential operator acting on the order zero price, and 
(ii) as a Fourier transform. We implement our pricing approximation on a Heston- 
like model which allows for both stochastic volatility and stochastic jump intensity. 
We find that our second order expansion provides and excellent approximation for 
prices (as seen through corresponding implied volatilities), as well as for the Greeks 
А and Г. 
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Asymptotic Expansion Approach in Finance 


Akihiko Takahashi 


Abstract This paper provides a survey on an asymptotic expansion approach to 
valuation and hedging problems in finance. The asymptotic expansion is a widely 
applicable methodology for analytical approximations of expectations of certain 
Wiener functionals. Hence not only academic researchers but also practitioners have 
been applying the scheme to a variety of problems in finance such as pricing and 
hedging derivatives under high-dimensional stochastic environments. The present 
note gives an overview of the approach. 


Keywords Asymptotic expansion - Derivatives * Option pricing - Hedge : Greeks - 
Stochastic volatility - Interest rate - Term structure model + Malliavin calculus - 
Watanabe theory 


1 Introduction 


Let (Q2, F, (-1]:epo,r], P) denote a probability space with filtration, on which a r- 
dimensional standard Wiener process W is defined, where P is an appropriate pricing 
measure (a risk neutral measure) in finance, and Т denotes some positive constant. 
Now, let F (w) be a Wiener functional and then V, the security or portfolio value can 
be expressed as V = E[F (w)] under certain conditions. Evaluating this expectation 
is one of the main issues in finance. Moreover, if Е depends on the parameter 0, 
computation of oy = SEF (ш; 0)], the sensitivity of the security value with respect 
to the change in this parameter (so called Greeks) is also an important task in practice. 
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As an example, let us consider a d-dimensional diffusion process X“ which is 
obtained as a strong solution to the stochastic differential equation; 


ax? = Vo(X®, ddt + У(Х, 9aW,, t e [0, T]; XO = хо, 


where є € [0, 1] is a known parameter. Here, the coefficients are assumed to sat- 
isfy some regularity conditions. In finance, many problems of pricing derivatives 
and evaluating the portfolios in investment theories are reduced to the problems of 
computing ELF (X51. the expectation of FO. that is a function of хе. 

In finance applications, it is important to deal with not only a smooth function 
f (x) but also non-smooth one. For example, when various options are evaluated, 
f is expressed as f = T o g, where T (x) = max{x, 0} and g stands for a smooth 
function of R7 +> R. In general, it is difficult to represent this expectation explicitly 
except for special cases. Hence, numerical methods such as Monte Carlo simulations 
or numerical solutions of partial differential equations (PDEs) are employed and 
various speeding up techniques are developed, since fast and precise computation is 
required in practice. 

As а different approach, an approximation of the expectation by an asymptotic 
expansion of the stochastic differential equation around e — 0 may be considered. 
Furthermore, because oec El f OX Г 1 апа EI f(X t ) )], the sensitivities of the secu- 
rity value with respect to the changes in the initial value xo and in the parameter є are 
important indicators for practical purposes, the approximations with high accuracies 
are so valuable. Moreover, some schemes that combine Monte Carlo simulations 
with asymptotic expansions with low orders are developed, since the asymptotic 
expansion up to the first or second order can be easily evaluated. Those schemes are 
able to improve the efficiencies of Monte Carlo simulations and the accuracies of 
approximations obtained by the asymptotic expansions. 

An asymptotic expansion approach in finance has been developed for the past two 
decades, which is mathematically justified by Watanabe theory (Watanabe [111]) 
in Malliavin calculus (e.g. Malliavin [64], Chap. V-8 in Ikeda and Watanabe [39], 
Nualart [72]). To the best of our knowledge, the asymptotic expansion technique is 
firstly applied to finance for evaluation of average options that are popular deriva- 
tives in commodity markets. Kunitomo and Takahashi [48] and Takahashi [85] derive 
approximation formulas for average options by an asymptotic expansion method 
based on log-normal approximations for average prices distributions, when the under- 
lying asset prices follow geometric Brownian motions. Yoshida [119] derives an 
asymptotic expansion of an average option price around a normal distribution for a 
general diffusion model, which is a byproduct of his result in statistics [118] based 
on the Watanabe theory. 

Thereafter, the asymptotic expansion approach have been applied to a broad class 
of valuation problems in finance, which includes pricing options with stochastic 
volatility models, pricing options under Heath-Jarrow-Morton (HJM) models [37] 
or Libor market models (LMM) (Brace, Gatarek and Musiela [7], Jamshidian [43]) 
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of interest rates, and pricing so called exotic-type options such as basket and barrier 
options in addition to average options. 

For instance, please see Kawai [44], Kobayashi, Takahashi and Tokioka [45], 
Kunitomo and Takahashi [49—51], Li [59], Matsuoka, Takahashi and Uchida [66], 
Muroi [67], Nishiba [71], Osajima [75], Shiraya and Takahashi [78—80], Shiraya, 
Takahashi and Toda [81], Shiraya, Takahashi and Yamada [83], Shiraya, Takahashi 
and Yamazaki [82], Takahashi and Matsushima [88], Takahashi and Saito [89], Taka- 
hashi and Takehara [90—94], Takahashi, Takehara and Toda [90, 91], Takahashi and 
Tsuzuki [98], Takahashi and Uchida [99], Takahashi and Yamada [100-104], Taka- 
hashi and Yoshida [106, 107], Takahashi and Takehara [92, 93], Violante [110], Xu 
and Zheng [112, 113], and Takahashi [86, 87]. 

We briefly introduce some of above works in Sect. 3.6. Moreover, we remark 
that the asymptotic expansion approach is employed by Yamanobe [116, 117] in 
physics for analyses of the impulse-driven stochastic biological oscillator and global 
dynamics of a stochastic neuronal oscillator. 

We also note that there exist many other types of the expansion/perturbation 
methods which have turned out to be so useful for applications in finance. For exam- 
ple, see Bayer and Laurence [2], Ben Arous and Laurence [3], Benaim, Friz and 
Lee [4], Col, Gnoatto and Grasselli [9], Davydov and Linetsky [11], Deuschel, Friz, 
Jacquier and Violante [12, 13], Forde and Jacquier [18], Forde, Jacquier and Lee [17], 
Foschi, Pagliarani, Pascucci [19], Fouque, Papanicolaou and Sircar [20, 21], Fujii 
[24], Fujii and Takahashi [25—27, 29], Gatheral, Hsu, Laurence, Ouyang, and Wang 
[30], Gnoatto and Grasselli [31], Gulisashvili [32], Hagan, Kumar, Lesniewski and 
Woodward [33], Henry-Labordére [38], Kato, Takahashi and Yamada [46, 47], 
Kusuoka and Osajima [57], Lee [58], Lipton [60], Linetsky [61], Osajima [76], 
Pagliarani and Pascucci [77], Siopacha and Teichmann [84], Yamamoto, Sato and 
Takahashi [114], Yamamoto and Takahashi [115], and references therein. 

The organization of the paper is as follows. The next section describes the outline 
of the asymptotic expansion approach in a general diffusion setting. Then, Sect.3 
explains a computational scheme for the expansion method. Section4 provides an 
extension of the general computational scheme in the previous section, and Sect. 5 
briefly introduces two improvement scheme for the expansion method. Section б 
extends the approach to non-diffusion Wiener functionals by using an instantaneous 
forward rates model as an example. Sections 7 and 8 introduce an asymptotic expan- 
sion in jump-diffusion models and a perturbation scheme in forward backward sto- 
chastic differential equations (FBSDEs). Section9 concludes. 


2 Asymptotic Expansion in General Diffusion Setting 


Following [87, 96], this section briefly describes an asymptotic expansion method 
in a general diffusion setting. 
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Let us consider a d-dimensional diffusion process xe = Qe T — ХИ) 
which is the solution to the following stochastic differential equation: 


dx? = Vd (xO, gar + eVi(X)dW, (j =1,...,4) (1) 


x = x0 € К, 


where W = (W!,..., W”)! isa r-dimensional standard Wiener process, and є є 
(0, 1] is a known parameter. Here, x! denotes the transpose of x. Next, let us define 
Vo = (,..., v : R? x (0, 1] & R? and V: R? > R? & R” whose jth row 
is VJ, j = 1,..., d. Suppose also that Vo and V satisfy some regularity conditions. 
(For example, Vo and V are smooth functions with bounded derivatives of all orders.) 

Next, let a function g : R” > R be smooth and all of its derivatives have 
polynomial growth. Then, a smooth Wiener functional g(X 5 ) has its asymptotic 
expansion: 


gx?) ~ gor + єт + ёфт + 


in D” as є |, 0 where дот, gir; gor. ... € D”. For any k є №, д є (1, оо) and 
s > 0, this expansion means that 


1 TUR 
a lax) — (gor + egir ++ gii r)llgs = OC) (as € { 0), 


where |[G |14,5 represents the sum of L?-norms of Malliavin derivatives of a Wiener 
functional G up to the sth order. Further, a Banach space D, , = D; ,(R) can be 
regarded as the totality of random variables bounded with respect to (q, s)-norm 
Il - |5, and DY = 59124 <o Dg,s. The coefficients gir € 0° (п = 0, 1,...) in 
the expansion can be obtained by Taylor's formula and represented based on multiple 
Wiener-Itó integrals. For the details of definitions and proofs above, please consult 
Watanabe [111], Chap. V of Ikeda and Watanabe [39], Malliavin [64], or Chap. 7 of 
Malliavin and Thalmaier [65]. 


Remark 1 As ап example of applications in finance, X consists of n stocks, X? = 
(S(9, ..., ST and g(-) is those weighted sum g(x) = шуху +--+ + WnXn for 
x = (x1,...,%)! with constant weights wj(i = 1,...,n). Then, g(x) would 
represent the spread, the average or the basket price of the stock prices. 

As another example, we can set X (9 is a vector of N forward Libor rates, Х © = 
(OP city and 


N-1 1 

1- П — 
=0 (є) 
1 l+rL it 


N-1 rji 
Tio IT; 


1 
j=0 (© 
. тт 


, 


(6) (є) 
90Х7 ) = SRy = 


that is a swap rate with inception date Т and maturity date Ту = Т + Мт. Here, 
L jr stands for the forward Libor rate at T fixing at T + jr with tenor 7. 
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ak y (©) F 
Let Ак = bo Lo and Ai, j =1,...,d denote the jth elements of A;;. In 


particular, A1; is represented by 
: 1 0 0 
Au = | Ya! (avs au + vaca). Q) 
0 


where Y denotes the solution to the ordinary differential equation: 
dY; = OVo(X, 0)¥,dt; Yo = Ia. 


avi (x,€) 
Du 


Here, OVo denotes the d x d matrix whose (j, k)-element is Ox Vi = : vý 
is the jth element of Vo, and Jy denotes the d x d identity matrix. 


Fork > 2, А], j = 1, ... , d is recursively determined by the following equation: 


1 f'.5 
AL--- T okvi (XO, 0)du 
k! Jo 


k 0 в 
1 1 quiim. e RNC" 
TE (k — D! au. I] Аг | 95,2 Vo (X. 0)аи 
1=17, 3 М JO en L 
13,48 J 
V 1 ' а dj Bi 
+2; Bt [ П Ар, 85 VÀ QGP)aW,, (3) 
i3,dg j=l | 
|] a! gl P 
where д. = 37. 85, Bum 
@) 1 
= 2, " 
їз. В=1 Тзє1 4 dae(1,....d)? 
forl > 1, 
B 
Lig i= 4Mg = (ly... , 1a); E -L(,l;,8eN). (5) 
j=l 
and for / = 0, 
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Then, gor and gır can be written as 
0 
gor = А y 


9т = 3075078 
j-l 


where 0; g(x) = 22:90. j=1,...,d. 
For n > 2, gnr is expressed as follows: 


(n) 
B d; 
9пт = p» 3 XPA -- Apr (6) 
I5,do 


Here, we note that each А! (i =1,...,d,1=1,2,...,k,0 < t x T) has all 
finite moments due to a grading structure. We describe the definition of the grading 
structure by following pp. 45—47 in Bichteler, Gravereaux and Jacod [5]: Consider 
the stochastic differential equation of the form: 


dS; = u(S;, t)dt + 0(5;, t)dW;; So = so € К“, (7) 


where u : R? x R* — R^ апіс: В x ВТ > R^ & R'. 


Definition 1 A grading of R? is a decomposition R? = R@ x ... x R^ with 
d = dı +--+ + dg. The coordinates of a point in R4 are always arranged in an 
increasing order along the subspace R, and we set Мо = 0 and M; = di +--- +d, 
for 1 < L < q. We say that the coefficients p and с are graded according to the 
grading R? = R^! x ... x R% if и! (х, t) and oi (x, t), j = L...,r depend upon 
only through the coordinates (хк) км, when Мр] <i < My. 
Theorem 1 We assume that the coefficients р and с in (7) are graded according to 
R? = R^? x. -» x R^. Moreover for F(x,t) = u(x,t) oraj(x,t), j =1,...,7, 
we assume that F is differentiable in x on R and 

|F (0, t)| < Zi fori =1,...,d 
: pA z P x, t)| € Z3 xi forall, J 


3; рі, D|zGifMpy.izxi,j < Mp for some p < q, 


where С, 0 > 0 are constants, and Z, Z are predictable processes such that ||Z || p 
1/р 

апа |2 ||, are finite for all р > 1 where 121, = {Jo E паа) ‚ Then (7) have 

a unique solution S, and for every p = | there are constants ср and үр depending 

only upon (C, 0, (11Z]l 3p), such that 


|| sup Selle < Ср(50 T IZI). 
0<t<T 
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For the detail of the definition and theorem above, see pp. 45—47 in Bichteler, 
Gravereaux and Jacod [5]. 

Applying Theorem 1 to the system of stochastic differential equations consisting 
of А! (і = 1,...,4,1= 1,...,k,0 < t < T) as well as any products of them, we 
obtain the following lemma. 


Lemma 1 Each coefficient in the expansion, Ai (i =1,...,d,1=1,...,k,0 < 
t < T) has all finite moments. 


(Proof) We consider the system of stochastic differential equations (SDEs) for 
Al,..., Ad, ALAL, ..., ATAS, Al, ..., Ай... Then, the coefficients of the SDEs 
are represented by the derivatives at є = 0 of Vo (X © ,€) and V(X (9), which аге 
bounded in [0, Т]. Moreover, it is easily shown that the coefficients of the equation 
are graded and satisfy the conditions in Theorem 1. Hence each coefficient in the 
expansion, Аі, has all finite moments. 


Next, let normalize g(X С ) ) to 


g(X) — gor 
€ 


Go = 
for є € (0, 1]. Then, we have 


GO ~ gir + egor + 


in р. 
Next, for h € H, where H denotes the Cameron-Martin subspace of the r- 
dimensional Wiener space, the H derivative of G is expressed as 


d d T 
1 j E : 
DiG =- У ag XP DA XL" = У 400) / [f r9)! v a ys at, 
| 0 
ї=1 i=l 

where Y? is the Ҝ & R¢-valued stochastic process which is the solution to the 
stochastic differential equation: 

m 

ау = у(х, 9Y P dt + єў ovi (xv dwin Y? = la, 
1=1 


In fact, Y, = ү. Неге, дҮ!(ї = 0), 1,..., т) denotes the d x d matrix whose 
(j, k)-element is O Vi. (ôk = a) 


Moreover, with a notation 9o that is defined by 


009 = (agat) [virt vact?]. 
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е 
where (osx) = (81g (X), m 949 (X P), the Malliavin (co)variance of 
G is given by 


T 
Og = a 99 (993 at. (8) 
0 
Moreover, let 
à ^ T 
f, :— = (одох?) [клу 


and make the following assumption: 
T ^ ^ 
(Assumption 1) Xr = f V, Ý, dt > 0. 
0 


Note that gır follows a normal distribution with variance X7, and the density function 
of gır denoted by f, (x) is given as 


fis) ( no) 
Jar? = VJ 27 UT exp 2XT 


where 


T T 
c :- (agx) n Y Y 0, Vo (XÓ9 , odt. (9) 


Since Ут is the variance of the random variable gır, which follows a normal dis- 
tribution, (Assumption 1) means the condition that the distribution of gır does not 
degenerate. In application, as it is easy to check this condition in most cases, it plays 
an important role for practical purposes. 

Next, let us briefly introduce a truncated version of the Watanabe theory [111] 
based on Yoshida [118, 119]. Under (Assumption 1), осо is uniformly non- 
degenerate for {In < 1}; that is, it can be shown that there exists a positive 
real number со > 0 such that for any c > co and p > 1, 


sup E[1 


o ay loe 771 < оо, (10) 
€€(0,1] 


{с 


where © = ИН IVO — .lar. 

Let S be the real Schwartz space of rapidly decreasing C??-functions on R and 5” 
be its dual space. Then, for b: Rt> К, Ф є Sa composite function ф(°)Ф o 
GO = YEGO) is well-defined as an element of D7% = Us zo e paoo Dp,s- 
Here, (x), x є R denotes a smooth function 0 < w(x) < 1, defined as w(x) = 1 
for |x| < 1/2 and (x) = 0 for |x| > 1. Here, a Banach space Ю, ;, 5 < 0 is the 
dual space of D,, ., (R)(q = p/(p — 1). 
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Moreover, the coupling with the function 1 is well-defined, which is called as 
generalized expectation and is written as E[/ (п )Фо С 9]. Further, v(?)o oG(? 
can be expanded іп D ?*. 

In addition, it can be shown that {7} (ш); e € (0, 1]) с D®, n? (ш) is O(1) in 
D^? as є | 0, and that for any ао > 0 there exist positive constants а;, i = 1,2,3 


such that P(tnC| > ao}) < a, exp(—aoec ^). Hence, for апу k = 1,2,..., we 
have , 

«Pn?» 5) 

lim ———— ——— < oo 

є}0 ek 


This means that the probability of the events truncated by mox ) is smaller than any 
polynomial orders of e. Then, in the expansion of v (n? )9 o G, the coefficients 
expressed as generalized Wiener functionals belonging to D-^ can be written by 
applying Taylor's formula to Ф (gor + egir + €? gar +: · - ). Therefore, the asymptotic 
expansion of the expectation E[d (G?)] can be obtained relatively easily. For the 
details of Watanabe theory and its truncated version above, please consult Watanabe 
[111] and Yoshida [118, 119]. For its application to valuation problems in finance, 
please also see [50]. 

In particular, if we take the delta function at y € R, ôy as Ф, that is P (x) = ó, (x), 
we obtain an asymptotic expansion of the density function of С (9. Moreover, because 
functions such as (x) = тах{х, 0} that is measurable but not smooth, frequently 
appear in finance, the framework mentioned above is necessary for the asymptotic 
expansion. 

For instance, when we take max(x, 0}, min(x, 0} or ó, (x) as (x) for a useful 
application in finance, the expectation of ®(G“) is expanded as follows: for N = 
OF 1, 234255 


N (n) 


E[o(G)] = pii —E oir) NES «pr | | +o”) 


п=0 k j=l 
N (n) 


= [eq] oct 


n=0 kin 
N (n) 
2 a a ФО ED [gir = x] fyr dx +o") 
n=0 kin 
N (n) 
-ZePa cce 
п=0 En 
m > 
x 05 [ex gir = ER dx + o(c") 


(11) 
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where ФО”) (gir) = TRW 


(n) 
X = рь ad 


, 
X=91T kin 


m 


xn .— Leer. (12) 
j=l 


In order to compute the asymptotic expansion (11), we need to evaluate the condi- 
tional expectations of the form: 


E | Xo 


giT =x] , 


where X' is represented by a product of multiple Wiener-Itó integrals. 

Inthe preceding works on application of the asymptotic expansion, the conditional 
expectations in (11) were directly computed with some formulas including multi- 
dimensional ones given for example, in [85, 86]. Recently, while the formulas up to 
the third order are given in the works, [95] has developed a high-order computation 
scheme for the conditional expectations by using the fact that each of these (A * } jks 


{9пт}п and also (X km }z, can be decomposed into a finite sum of iterated multiple 
Wiener-Itó integrals by ‘applications of the Itó's formula with certain properties of 
iterated multiple Wiener-Itó integrals. (Please see Sect. 4 of [95] for the detail.) 

On the other hand, as shown in the next section, we can develop an alternative 
method which does not evaluate the conditional expectations directly. 


3 Computational Scheme 


This section follows [96] to introduce a computational scheme for the asymptotic 
expansion, which is an alternative to the direct calculation method for the conditional 
expectations given in [95]. 


3.1 Preparation 


To compute the conditional expectations on the right hand side of (11), we use the 
following lemma which can be derived from a property of Hermite polynomials and 
leads us to compute the unconditional expectations instead of the conditional ones. 


Lemma 2 Let (Q, F, P) be a probability space. Suppose that X € L^(Q, P) and Z 
is a random variable with Gaussian distribution with mean О and variance У. Then, 
the conditional expectation Е[Х|2 = x] has the following expansion in L?(R, и) 
where p is the Gaussian measure on R with mean 0 and variance У: 
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E[X|Z =x] = У, T. HG E) (13) 


where Hj (x; У) is the Hermite polynomial of degree n which is defined as 
2 а" 2 
Н„(х; X) = (E) e PY е 2 
dx" 


and the coefficients аһ are given by 


119 
an = hi” ae 


[cometa] ‚ G 2 М—1). (14) 
©=0 


(Proof) Since the system of Hermite polynomials { Н, (х; X)) is an orthogonal 
basis of L?(R, ш), and E[X|Z = x] € L?(R, ш), we have the following unique 
expansion of E[X|Z — x] in L?(R, и): 

E. d 
E[X|Z = х] = > — H, (x: X). 


yn 
п=0 


Since we have another Taylor expansion 
5 oo 
ix _ - 5x HaQ X), 
е^ =e 2 > D Ge», 
п=0 


Шеп, 


2 


2 . E B 
es *E[etZ x] = Ы! e**E[X|Z = x]u(dx) 
R 


e Ha: E) aw, Ha; Duds) 
= usur. 
п=0 


; : g " ; 
Comparing to the coefficients of the Taylor series of e ? "E[e/*7 X] around 0 with 
respect to ©, we see that a, can be written as (14). 


Next, we write Ӯ, = (OXO) Yry, VOR”) as V(x». Then, we define 
ĝi = (9i; t € Rt} and Z® = (249); t € R*) as the stochastic processes 


t 
TE n V(X), 
0 
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and 


M E 
29 = ехр (icis + z% , 


respectively, where X; :— Jo Y (xy (xyray. 
Then, from Lemma 2, the conditional expectations appearing on the right hand 
side of the Eq. (11) is expressed as 


E(X™ gir = x] = EIX""|dir = x — C] 


oo 
a 
= >, Hix — C; Xr) (15) 
5 
1-0 T 
where ; 
й 10 l kim 7E) 
а" rcm [ExzP. (16) 
POCHE OI Ls E 


Here it is noted that with this expression we now need to compute unconditional 


expectations E [x hz P] instead of the conditional expectations. 


3.2 Asymptotic Expansion of Density Function 


In this subsection, we explain a new computational method through deriving a general 
formula for the expansion (11) with an arbitrary specification of its order N. In 
particular, we show that the coefficients in the expansion are obtained through a 
system of ordinary differential equations that is solved easily. 


First, we define G ©) for Їз € Ly.g and dg € (1,..., d]? (n > B= 1) as 
B | | 


4 B 
а; а; 
жа; =Е | [412 |. (17) 
ў u 
and for n = О as 
пе 9 - E[z?]. (18) 


Then, by using (6) we write the unconditional expectations E[ X km 25 in (16) 
in terms of 7) as follows: 
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PT € 
E[X ZO |=E Leer 2 


j=l 
m (kj +1) j 

= Е l pfi g(XO)A APA ge 
П> s 4 


(k14-1) (km+1) m 


1 (0) d} &- dr 
=>: У П 5:95, 9 (Xp) s. oi (7:8) 
Dod тй Nr 05 
PY PY "m m 
(19) 


where 
dj 9 d), :— (dj... dg, dj, ... do, 


Ui, el UT. Ind 


d; І 
So, we have to calculate T; "(T; €) to evaluate the asymptotic expansion (11). 
B 


In the following, we derive a system of ODEs satisfied by these 159). Before 

B 
showing a general result, we first derive the ODEs for a few leading-low-order terms 
explicitly to give a better intuition of a key idea y our method. Particularly, let us 
consider the evaluation of (Т; 6) = Е Leas | which appears in the є-огӣег. 
Here, for simplicity, we assume that Vo does not depend on e, and write Vo(x, є) as 


Vo(x). In this case, we first note that the SDEs of Al, and АЛ, (j =1,...,d) are 
given as follows: 


dAl, = Sai ð; vi (xat + усх aW, (20) 
j=l 
d 
dA}, = 40, vi KP) +5 x ALAN Op д, VÀ XO) | at 
j '=1 2) k=l 
d 1 
+}, AL, OVI (Xd Wy. (21) 
j=) 


Also, the SDE of z® is expressed as: 


dZ = GV xOz aw,. (22) 
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Then, applying Itó's formula to ALZ ©, we have 
4041,20) = Ad dz® + zaal, + а(А1, 29), 


d d 

А i! су, 0 ; 0 ‚‚ Е 0 

= LS У АЛ, $axa, VI xy + У АЈ, Zh apvd KP) 
j=l j=l 


d 
1 "gk! У(© iy) 
+5 >) ALASZP Oj Oe VÀ] (XP) [dt 


1 


а 
; j » (0 j’ iey O 
+ {GOA zP V(X) + У AL zE pvi o Law. 
jai 


Since the last term is a martingale, taking expectation on both sides, we have the 
following ordinary differential equation for Noy: 


d E. a А 
п: 9 = GE) У my OV ОХИ )др vH Or" 
j=l 


d d 
of . 1 of А H 
5 Lf Jey ©) > JK p. Jey) 
T 4 ay £)Oy Vg (X; ) + 2 £ ШШ ©)ду Oe Vg (X, ). 
J= JGK = 


Here, ny j = 1,...,d) appearing in the right hand side of the above ODE are 
evaluated in the similar manner: 


4L ZÉ) = ajaz + zaai, + diaj, 2, 


d 
= [GOZO eov ord? e Y А ОН аА | ar 
=1 


[абл 090 000) + ziv od? аж, 


һепсе, ме һауе 


а 
а i : ^ " m . 
TOG O = GO 9 OP vA Or + D no 87 Vg ОХ). 
j=l 


пі m and other higher-order terms can be evaluated in the same way. The key obser- 
vation is that each ODE does not involve any higher-order terms, and only lower- or 
the same order-terms appear in the right hand side of the ODE. So, one can easily 
solve (analytically or numerically) the system of ODEs and evaluate the expectations. 
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The following proposition provides a way to calculate general п Т; ©) asa 
8 


solution to the system of the ordinary differential equations: 


Proposition 1 For т? (t; ©) defined in (17), the following system of ordinary differ- 
B 


ential equations is satisfied: 


8 
d dg R ER 1 dsj Ik 17d (0) 
“| wo} = 2 "IDA а: o] [ovi od 0] 


К= 
+ >> y ol. amet. p 
2 (lk — 0)! у! | Gem, 
k=1 l=1 tity d, 


x bi aic tys m. o] 


B (1-1 0-1) 


ajtem) Gs , 3 vd v 
sb > 2. 2 zd lei denos ol DU e ) 


x |а yon am] 


в (=) 
(dog, 
+0 PE EIL "yen, (5 ol 
ydy 
x |o үу ad] V(x) (23) 


where Py > is defined in (4), and 
т 
lsr = (l1, ..., loas lepus... lg) 
la] = (l, -s las leas ssl ан, .--,4g), Lak «nz 


lg & ту fem (h, ..., Ig, mi, ..., m4) 


forlg = (h, ..., lg) and my = (m,..., my). 


(Proof) We firstly apply Itó's formula to (II 4 Aj) by using (3) to obtain the 


following: 


360 A. Takahashi 


B з |в B ) 
а; а; а; а; а, Ds 
d П 2; =} Ap, | dA + > 1 II Aj), | (Ag Tw 
j=1 k=1 | j=1 k,m=1 j=l 
J#k k«m \ j#k,m 
B B 7 1 
j e vr dk 0 
=> Пл, ттд Vo (Xr ) O)dt 
k=l | j=! К" 
jzk 
B B а lk (D 1 y а 
j i! 
t 25 П, T DT m — —D!'wy AH Am ji 
k=1 | j=1 Ila, d, j=l 
jzk 


x0 OF уох) Ode 


a | ав 
| 8 | Б 
+2 ЦА» = Па е9) VEX aw, 
k=1 | j= 2 3 t | 
js “е? 


B k 2 (= 1) (s — 1) 1 
34 По. d. 
Y 


RIA ЕР 


" 
ЖШ n АЕ y P П д ji | 95 v^ acta. 
= j=l 

(24) 


Note also bas 424% = = (©)ў(х@)у® dW,. Then, applying Itó's formula again to 
(п; =i A ea S. and take expectations on both sides to obtain the result. 


Remark 2 Due to dm (t; €) = EIZ/?] = 1, and the hierarchical structure of the 


ODEs with respect to n = bn 


jc lj, one can easily solve these ODEs successively 


from lower-order terms to higher-order terms with initial conditions т (0; ) 20 
в 
for (/3, dg) # (0, V). 
Remark 3 Further, due to the structure of the system of the differential equations, 
it is easily shown by induction that each me (ts €) is expressed as a polynomial of 
B 


degree n — Xu 1; with respect to (i£). Then, we can also show that ЕХ z$] 
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is a polynomial of degree (п + m) with respect (о (i£), and thus а" =0(1»>п-+т) 
for km € Ln.m. This ensures a convergence of the infinite sum in (15). 


Then, from Lemma 2 and (11), we have the following expression of E[ 6 (G(9)]: 


E[6(G'^)] = e = | $ (x)(— D" 

п=0 kin m! R dx" 
n+m kin 

е > Yi Нбх — С; Ут) fyr (x) dx + o(e") 
1-0 T 

N (n) 1 
п 

= E. 

Yel vo 

п=0 kin 
n+m aim 

x NÈ diss Henr — C E fi | dx +(e") 
1-0 T 


Here we have used the well-known property of the Hermite polynomial: 


m = 


dxm Un =G; Er) far} == (=) Am (X =; Ут) fore ОО. 


In particular, let Ф be the delta function at x є К, бу, we obtain the asymptotic 
expansion of the density of G: 


foo (x) = Elo; (G)] 
(n) 1 n+m а" 


N 
=2,е > 03 sie Нн" — С; Er) four (2 + o(€%). (25) 
n=0 i 1=0 


km 
We summarize the discussion above as the following theorem: 


Theorem 2 Let X/? be the solution to the stochastic differential equation (1). 
Suppose a function g : R? > R is smooth and all of its derivatives have poly- 
nomial growth. Then, the asymptotic expansion of the density function of GO = 


9(Х@)—(Х 


N . . 
= ир to є -order is given by 


fao (x) = Лат ОЎ 


N 3n 
re (> Cnm Hm (x — С; 0 forr (x) + o("), 


n=l m=0 


(26) 
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where 


(27) 


ау 
DEOR vee ) 


1 
ех 
М2тт E ( 2XT 


with 
(VT [7 (0) 
= (дах )) f YrY; 13V (XO, Oat, 
0 


T 
Уг = f Vy x94; > 0, 
0 
V(X) = (9gx T vr v7! v (x. 


Hn (x; X) isthe Hermite polynomial of degree n with parameter У, which is defined as 
d" 
HG; E) = (C Eye PP „ела, a 


and 


(m) (ki+1) (ks +1) 


Cum = = “уз 22 54 > "ET б)! 


ks Dd, бй, 


б 


T IL; g x9 
: Bj! zu 
1 Fa 


жс Mı 76 
[т—8 д, m-—ó llo e 
= е0 


DN "me. (= У). 09 


т? (Т; ©) are obtained as a solution to the following system of ODEs: 
В 


В 


d |а ecl 
z D в; ol -P n, A ©де ур“ (X, 0) 


Eu! 
Bod (I) 

D > 1 84 g; 224 д Ly d (XO, 0) 
k—1 [=1 Lz Dy "d, en, 0 


tity ,d 
B (@—1)(—1) 1 m 2 4 
(dg/k,m)@dy@d5 


+ КҮТ E = 
y!ô! (8/k,m )&m. cm; 


К,т=1 >- > 
k<m Foy dy sis ds 


(t; €) 
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х д1 V^ OT v(x) 
ô 
В (dk—1 
eri a ie oy 224 pix oC ug 

i a 7! Nis 4) Oi 
d э 2 
n (0: £) = O for (їз, do) # (0.0), mg (ts = 1 for (їз. do) = Ø, 0). 

(30) 


Here, we use the following notations: 


l3jk = (li, ..-, loa а 18) 
Igik,n = (D, ..., las lea, «es Ins Inr s.s lg) 1E k«nzB 
lg @ my im (li, ... Ig, mi, ..., My) 


forla = (li, ..., lg) and m, = (mi, ..., m). 


Remark 4 Particularly, in order to calculate the expansion above up to the c?-order, 
we need the Hermite polynomials H, (x; X) up ton = 6, which are given as follows: 


Ho(x; >) = 1, 

Hi(x; У) = х, 

Hy; Я) = х? – У, 

Нз(х; X) = x? — 35x, 

H4(x; X) = x4 — 65x? + 3%, 

Н;(х; У) = х? — 10Xix? + 15X?x, 

Асб X) = xf — 159х* + 4559°х7 — 15%. 


3.3 Remarks on the Asymptotic Expansion 
for Multi-dimensional Density Functions 


We can also apply the conditional expectation formulas for the multi-dimensional 
case in Lemma 1.1’ of [85] and Lemma 2.1 of [86] to derive an asymptotic expansion 
up to the third order of the multi-dimensional density functions. This is particularly 
useful for pricing exotic-type options such as barrier options with discrete monitoring 
(e.g. [83]), and pricing Bermudan-type or approximate American-type derivatives 
(e.g. Nishiba [71 ]). 
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Moreover, we obtain the following result as an extension of Lemma2, which 
easily leads to an asymptotic expansion of a multi-dimensional density function in 
the similar manner as in the one dimensional case in Theorem 2. 


Lemma 3 Let (©, F, P) bea probability space. Suppose that X € L?(Q, P) and 
Z is a d-dimensional random variable with Gaussian distribution with mean 0 (d- 
dimensional zero vector) and variance-covariance matrix У. Then, the conditional 
expectation E[X|Z = x] forx € Rf has the following expansion in L?(R4, и) where 
и is the Gaussian measure on R with mean 0 and variance-covariance matrix à: 


оо 
E[X|Z 2 3] = У aj Нб; X). (31) 
\7i|=0 
where n = (n1, n2, ..., na), |n| = п 4- n2 +-+- + па, n! = ni'n! --- па! and 
n EP" ow. 
quet e [6 zig [/ 92x] = =) | m5 
n ilnl дё |^ = 
&=0 
Нете, au denotes the transpose of È. Н; (X; X) stands for the d-dimensional mul- 
tiple Hermite polynomial of degree |n| with n = (n1, n2, ..., nq): 
Н(Х; X) : : : [x : £] 
п (х; = — ee n[x : X]; 
n = n[x : X] Ox, д Ox, E 
X = (x1, X2, .... Xd) (33) 
where 


3 1 1 
i ELT aos ео |5 


rz. (34) 


(Proof) Basically, we can make a similar discussion as in the proof of Lemma 2. 
Indeed we first note that the system of the following Hermite polynomials is a com- 
plete biorthogonal system in L?(R4, и): 


ЕИ E) i = posco nda); ni = 0, 1,2,..., (i = 1,2,..., d)}, 
{He : E): A= (n,n... ng ni = 0,1,2,...,0=1,2,...,0)}, 


where H; (X : X) is given by (33) and H;(X : X) is defined as follows: 


TEE a US. ор. 
iG ЕЮ = тегул ( all A ( jum dii 
1 


y 


Il 
~ 
= 
5e 
i 
Ба 
a 
— 

4 

ll 

IM 
| 
al 
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Thus, we have the following expansion of E[X |7 = X] in L? (R$, и): 


оо 
E[X|Z =X] = У ан: E). 
|n|=0 
On the other hand, we know the relation: 
oo iw . PE 
Y OP ig zy = ef eB, (36) 


J 
|j|=0 


where (£)! = (i€1)/! (£2)? --- (i€a)/4. Hence, 


oo Ns ? 

i£'x —A£T x£ (ig)/ Bh 

e =e 25 2 ү H; : X). 
|j|=0 


It is also well known that 


тИ 5. T" - m! (if m =n), 
[a Hg (x : 3) Hg(x : X)n[x : X]dx = | 0 Gf m + à). (37) 


Therefore, 


eiT Eip CE x] ED P; DE bxiz]] 


8 


х | У ан: E) рах) (38) 
|n|=0 


ay approve. 489) 


[910 


and making п = (пі, . .., па) order differentiation of both sides in the equation 
above with respect to £ = (£1,..., £4) at = 0, we obtain (32) and hence the result, 
(31)-(34). 
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3.4 Expansion of Option Prices 


Now, we apply the approximate density function in Theorem 2 obtained by the 
asymptotic expansion technique to option pricing. 

In particular, we consider a plain vanilla option on the underlying asset price 
process (g(X (9 ))refo,7], Where (X (95 етот is the solution to the stochastic differ- 
ential equation expressed as the Eq. (1). As an example, we obtain an approximation 
of a call option price as follows. 


Theorem 3 Ал asymptotic expansion up to the e+» -order of a call option price 
at time О with maturity T and strike price К where К = gx — ey for arbitrary 
y € Ris given as follows: 


C(K, T) = cP(0, T) й (° i =) +CN (=) 4+yN (222) (40) 


JE VE d 
+e p, T)Cno 22 (==) ты =] 


n=1 


N 
p yer) VEE) 
+) е P(0, T)C,4 [r v (E Xr yn Te 


n=1 


N 
+ >i et PO, T) 
п=1 


Зп 


> Cnm E Ут Hm-ı (C +C); Хт) n (° = =) 


Ут 


т=2 


+ Lf Hm (O +С): Хт) п (222) 


У +1 PO, T) Cag ( tL 
1253€ (0, T) Cro Wo 


n=1 


N 3n 
у+С 
+y Le PO, T) >) Com V ErHy-i(-( +С); Er) n (=) + o(e™+D), 
Т 


п=1 т=1 
Here, Cnm is given by (29), and Н, (х; У) is the Hermite polynomial of degree n with 


parameter У, which is defined as 


Е а" 2 
Hy (x; E) = (Byte 29.7 „чепи, 
X 


C and Ут are given respectively by 


T T 
c = (0s) n YrY; 0, Vo(X(9 , O)dt 
0 
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and 
T 04 5 0 
Xs =| Vx OY ar, 
0 

where 

7 0 0 = 0 

VX) = OPY Yr, V(X). 
Also, P(0, T) denotes the price at time 0 of a zero coupon bond with maturity T. 


N(x) stands for the standard normal distribution function, and its density function 
"vol — Dd 200, 
is given by n(x) — Jat А 


(Proof) We firstly note that the call price is expanded as follows: 


C(K, T) = P(0, T)E[max(g( X?) — K,0)] 


(е) = (0) (0) _ 
[eet (unn 


= eP(0, T)E [max [co 4 y, of | 


= єР(0, г | (x + y) fgo м GOdx + о(Ү+Р), (41) 
>y 


Here, fg y is the asymptotic expansion of the density of СӘ up to e" -order, 
which is given by the first two terms on the right hand side of (26) in Theorem 2: 


N 3n 
foo N GQ) = Лат ОЎ ТЕ Уе (> Cnm Hm (x — С; =n) for ОЭ, (42) 


n=1 m=0 


where 


far = = ©) 
SIT = * 


1 
ex 
„лт »( aus 


Next, we note the well-known properties of the Hermite polynomials: 


4 Ay (x; У) = n Ha (x; У) (43) 
ах 

нау (5) Hn+m(x; E)n(x; E) 

ах" У 


Hj44(x; X) = x Ay (x; X) — XYnH, 4 (x; У), 


ENS 


e 2x, 
VZT 


where n(x; 3) = 
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Then, we can obtain the following expressions for the Integrals appearing on the 
right hand side of (41): 


oo C 
| Soir x)dx = N (=) , m 
: +C y+C 
А а. { TEN =) ; 
| = ч ; Te) 
у" © = C; Èr) for @)dx = V Er Hm—ı (CO +C); Er) n ( m) m > 1, 
оо И 
/, x Hy, (x — С; Er) far dx = -V Ery Ни—1(—(у +С); Er) n (x) 


T 2 , , zw 
ý МУТ 


Remark 5 In practical applications, usually the underlying model is given as a non- 
perturbed form: 


dX] = Vd (yat + Vi(X)dW, (j =1,...,d) (45) 
Xo = x0 € R’. 


Then, in order to apply the asymptotic expansion method, we may rewrite the model 
for instance, as 


ах = vi (X)dt + ev/(x(9)aw, G = 1,...,а) (46) 


x = x9 € R, 


where by rescaling vi (x) we set V (x) so that vi (x) = €V/ (x) for some € є (0, 1]. 
Consequently, an approximate call price under the original model (45) is obtained 
by (40) without о(є\+!). 


3.5 Application to Computation of Greeks 


We already have a so called closed form approximate formula (40) for the option 
price, and hence are able to obtain approximations of its Greeks (that is, sensitivities 
to the changes in parameters in a model) as closed forms as well (or at least with 
easy numerical method such as the difference quotient method with the approximate 
option pricing formula). 

For instance, [68] implements direct differentiations of the approximate formulas 
for option values under a time-homogeneous general local volatility model, and 


Asymptotic Expansion Approach in Finance 369 


obtains closed form approximate formulas for the Deltas and Vegas. Moreover, [68] 
applies the similar technique to computing the Deltas and Vegas for average options 
with continuous monitoring, and gets their closed form approximate formulas as well. 
They also confirms the validity ofthe approximations through numerical experiments 
in the CEV model. 

By deriving asymptotic expansions of characteristic functions of option values, 
[93, 94] propose a new expansion scheme for pricing options on long-term currencies 
under a Libor market model (LMM) and a general diffusion stochastic volatility 
model with jump of spot exchange rates. Furthermore, applying the approximate 
formulas, they provide analytical (closed form) approximations for the Deltas and 
Gammas of the options. Please see [93, 94] for the detail. 

Alternatively, for a parameter 0, the sensitivity ofa call price C(K, Т) with respect 
to the change in 0 is expressed as follows: 


aC. T) = P(0, T)E[max{g(X}) — K, 0}] 
= E (РО, Т)Е [max [69 + у, 0}]) 
= P (eP(0, т) Е [max [c des of | 


+єР(0, T) (SE [max [c + у, o})) 


Е (= 2) СлЕ(К, T) 
B 90 eP(0, T) 


ac® д 
+єР(0, Т)Е Є + x) КИ] , (47) 


where СдЕ(К, Т) stands for the approximate call price with strike К and maturity 
Т, which is obtained by the asymptotic expansion. 

Then, we are able to obtain an approximation of the sensitivity by a direct appli- 
cation of the asymptotic expansion to the above equation, particularly, the second 
term in the last equation. For example, under one dimensional diffusion setting, that 
is a general time homogeneous local volatility model, [66] successfully applies the 
expansion technique to computation of the Deltas and the Vegas with numerical 
experiments. 

More generally, we note that the similar method as in option pricing in the previous 
subsection can be applied in Greeks, since we can take c S' for E[®(G)] in 
(11) and apply the integration-by-parts method in Malliavin calculus. Recently, [103] 
takes this approach and derives asymptotic expansions of Greeks around the Black- 
Scholes model in stochastic volatility environment, and develop a unified method 
for precise estimates of the expansion errors. Particularly, they make use of the so 
called Kusuoka-Stroock functions introduced by Kusuoka [52], which is a powerful 
tool to clarify the order of a Wiener functional with respect to the time parameter t 
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in a unified manner. Then, they estimate the error bounds for the Malliavin weights 
of both the coefficient and the residual terms in the expansions. 


3.6 Approximations of Asset Values Under Diffusion Processes 


The framework of the asymptotic expansion can be applied not only to the simple 
cases mentioned above, but also to evaluation of much broader range of asset and 
security values. In particular, there are many cases where the asymptotic expansion 
can be applied to approximate their values when the underlying asset prices of finan- 
cial securities, cash flows and interest rates are expressed as some functions of a 
random vector X that follows a diffusion process. The method is almost the same 
as the one illustrated above and hence it is omitted. In this subsection, we only review 
how to represent the values of financial assets. 

First, just as in the previous subsections, we consider a d-dimensional diffusion 
process X defined as the strong solution to the stochastic differential equation (1). 
As an example, the present value V of a financial asset which generates a cash flow 
at the maturity date T' is represented as 


y « E[e- h e urgay], (48) 


where g denotes the underlying asset price and F is the cash flow which characterizes 
the asset to be evaluated. Note that the underlying asset price g follows a diffusion 
process, whose drift term (the coefficient of the dt term) 15 R1(X (9) g — D(X (9) 
under an equivalent martingale measure. Moreover, Ау at time t € [0, Т] is repre- 
sented as 


Л 
(19) = 019) + У sif?) 
j=l 


where r denotes the risk-free interest rate and 51;, j = 1,..., Л stand for various 
spreads (the differences from the risk-free rate) such as credit spreads and liquidity 
spreads. Suppose also that those are expressed as functions of the variable Х(9. Fur- 
ther, D(X ( 9) denotes a payoff generated by the underlying asset such as a dividend or 
an interest rate and is also represented as a function of the variable X (9. Meanwhile, 
the discount rate at time ¢ that is, R2 (X © ) of the target asset F to be evaluated is 
also expressed as 


Jy 
Ry?) = (Xp?) + У s 19), 
j=1 


where 52у, j = 1,..., J2 are various spreads related to the objective asset or security. 
We again assume that those are expressed as some functions of the variable X(9. 
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As an example, let F = 1 in (48) for a zero-coupon bond with the face value 
1 and the maturity date Т. Also, let V; denote the price of the zero-coupon bond 
with the maturity 7;. Then, V, the value of a coupon bond with the maturity Ту 
and coupon (and principal) payments c; at T;i(j = 1,..., №, Ti < --- < Ty) is 
represented by the equation V = 55; ci Vi. Moreover, the present value of a call 
option on the coupon bond with the option maturity T (< Тү) can be evaluated if we 
set F(x) = (x — K)* and g(X()) = SN, cigi (X?) in the equation (48), where 
a, i = 1,..., N are given by 


Tj (o 
gi (XO) =E EL ROG, xd | 


Finally, we briefly review applications of the asymptotic expansion technique to 
numerical problems in finance, which can not be introduced in the present note due 
to the limitation of the space. 

Takahashi and Yoshida [106] applies an asymptotic expansion to a dynamic invest- 
ment problem with utility maximization for the asset at the end of the investment 
period, and derives an approximation formula for evaluating the optimal portfolio. 
Although the optimal portfolio has been numerically evaluated as a function of deriv- 
atives of the solution to some Bellman equation except for special cases, it is a hard 
task to implement it when the number of assets is large. Takahashi and Yoshida 
[106] provides its approximation based on the representation which Ocone-Karatzas 
[74] derives by using the so called Clark-Ocone formula. Moreover, [45] applies this 
method to a dynamic bond portfolio problem. 

In evaluation of the expectation of a Wiener functional based on Monte Carlo 
simulations, [107] proposes a new estimator with a control variate which has its 
expectation explicitly obtained by an asymptotic expansion, and has a high correla- 
tion with the target Wiener functional. The convergence of the simulation based on 
this estimator becomes much faster and the approximation error with the asymptotic 
expansion up to a low order such as the first or second order is decreased. As for the 
extension of this method, please see [51, 88, 99]. 

For pricing American options, [89] extends a well-know decomposition formula 
for an American option value by Carr-Jarrow-Myneni [8], and proposes an approxi- 
mation of the value by making use of the approximate density function of the under- 
lying asset, which is obtained by the asymptotic expansion. 

Moreover, because of its generality and unified nature of this approach with ana- 
lytical (so called closed from) formulas, the asymptotic expansion method has been 
applied to broad class of valuation models which have become popular recently in 
practice. Especially, comparing to other numerical approximation schemes such as 
the Monte Carlo simulations and numerically solving methods for the partial differ- 
ential equations (PDEs), it has an advantage in high dimensional problems. We list 
the following works as examples. 
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Applying the framework described above to default risk models, Muroi [67] 
derives asymptotic expansions for approximations of CDS (credit default swap) 
spreads. 

Shiraya et al. [82] applies the expansion technique to obtain an approximation 
of swaption values under the Libor market model(LMM) of interest rates (Brace, 
Gatarek and Musiela [7], Jamshidian [43]) with local-stochastic volatility models. 

Takahashi and Takehara [90—92] develop asymptotic expansion formulas for pric- 
ing long-term currency options with a Libor market model(LMM) of interest rates 
and diffusion or jump-diffusion stochastic volatility processes of spot exchange rates. 
Moreover, [92] presents a new characteristic-function-based Monte Carlo simulation 
scheme with the asymptotic expansion as a control variate. 

Takahashi and Takehara [96] develops a general computation scheme for a high- 
order expansion method explained in this section, and applies it to ће SABR model 
(Hagan, Kumar, Lesniewski, and Woodward [33]). They derives the expansions of 
the option prices up to the fifth order to show that the higher order expansion improves 
the approximations. 

Takahashi and Takehara [108] and Takahashi et al. [109] also apply this scheme to 
the long-term currency options such as the 10 year maturity one under a Libor market 
model (LMM) of interest rates and stochastic volatility processes of spot exchange 
rates. Again, they confirm that the fourth or the fifth order expansion provides the 
better approximations than the lower order ones. 

Furthermore, we are able to apply the expansion method to pricing the so called 
exotic type options. For instance, [78] derives expansions of average options with 
discrete monitoring under stochastic volatility models in order to obtain approximate 
prices of commodities average options. Moreover, they implement calibration to 
real futures plain-vanilla option prices of the underlying commodities, and evaluate 
average options based on the parameters obtained by the calibration. 

Shiraya et al. [83] develops new approximation formulas for pricing single and 
double barrier options with discrete monitoring under stochastic volatility models. 
In addition, they demonstrate its validity through numerical experiments. 

Shiraya et al. [81] presents a new approximation scheme for pricing continuous 
barrier options in stochastic volatility environment. Particularly, they make use of 
a static hedging scheme and the fifth order expansions of the vanilla options to 
obtain accurate approximate prices. Further, they derives the fifth order expansions 
for pricing average options with continuous monitoring under stochastic volatility 
models to achieve very precise approximations. 

Shiraya and Takahashi [79] develops a general scheme for evaluation of the so 
called multi-asset cross currency options. In particular, they derive the expansions of 
basket option prices with 100 underlying assets (200 state variables with their stochas- 
tic volatilities), and cross currency average/basket options with discrete monitoring 
under stochastic volatility models to obtain accurate approximations. 

Kato et al. [46, 47] develop a new expansion scheme for solutions of Cauchy- 
Dirichlet problems for second order parabolic partial differential equations (PDEs) 
and apply it to pricing down-and-out/up-and-out barrier options with continuous 
monitoring under stochastic volatility models. 
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4 Extension 


This section follows [97] which presents an extension of the general computational 
scheme of the asymptotic expansion described in the previous section. In particular, 
by a change of variable technique and by various ways of setting the perturbation 
parameters in the expansion, we are able to provide the flexibility of setting the bench- 
mark distribution around which the expansion is made, and an automatic way for 
computation up to any order in the expansion. For instance we introduce expansions, 
called the log-normal expansion and the CEV expansion. 


4.1 Change of Variable and Perturbation 


We consider a d-dimensional diffusion process X; = (X ie ews x?) which is the 
solution to the following stochastic differential equation: 


dX] = Vi (Xodt + V/(X)dW, (j = 1,..., 0) (49) 
Xo x9 € в“ 
where W = (W!, ..., W^) is an r-dimensional standard Wiener process; Ww : 
R? > R and V/ : R^ — В“ are smooth functions with bounded derivatives of all 
orders. 


Next, let C : к“ к> в be a C?-function which has the unique inverse function, 
C^, and define X, as X, = C(X;). Then, the dynamics of X is given by 


dX] = Vj (Х)а + Vi(X)dW, ( = 1,...,0), (50) 
Xo = Xo, 
where 


d 
WO = у бус 716» 


j'=1 


d 
1 C A TNCS Fo dj 
rds 1 yy v3 (c^ àyv* (c^! y, 


d 
Vig) = Y Bpi CEV c7! 6). 


j=l 


and Xo = С(хо). ((C~!(x))! denotes the transpose of (C^ (X)).) 
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Next, we introduce a perturbation parameter є є (0, 1] as follows: 
X, к> xg 
Via) e V9 o 
VJ (x) e eV! (3), 
and hence, the dynamics of XO is expressed as 


dF = VOI RO, garkepidx9)w, G21,...,d. 61) 


Then, we are able to apply the technique developed in the previous section to the 
transformed SDE (51). 


4.2 Applications to Option Pricing Under Local-Stochastic 
Volatility Model 


We assume that the underlying process is the unique solution to the following SDE: 


dS; = 0(X,;)h(S;)d W; 

dX] = Vd (XÐdt + Vi (X)dW, (ј =2,...,d) (52) 

So = so ER, Хо= хо є g^. 
where с: R! — R^ , h : R — R, апа W is a r-dimensional Brownian motion. 
Then, we evaluate a call option with strike K and maturity T, whose underlying 
price process is given by S. Under the zero discount interest rate for simplicity, the 
call price Call (K, T) with strike price К and maturity Т is obtained by 

Call(K, T) = E[(Sr — K),]. (53) 

First, for x = (x!, х2,..., x4), let 


C(x) = (C(x), x?, ..., х“), 


where C, : R > Risan invertible C?-function. Then, Š, = Cı (S+), which S follows 
a process of the solution to the following SDE: 


pn 1 ES n = 
48, = у|с(х C, ! Goy 61 (Cy Sat 


+ o(X)h(Ct (S5) CO (C; ,))4W,, 30 = Cio), (54) 
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where СО @) := C16) and CO (x) = S5 C10). 
Next, we introduce a pertmibation parameter є as follows: 


dš? = 19 D lex ya; (9c? Ст GO at 
oe yc c SO aw, 
axl! = vi (XO, odt + Vİ(XOAdW, G =2,...,d), (55) 
where (ce) = є“ and k isa nonnegative integer such as k = 0, 1, 2, .... Note that 


S0, (Socr Sp, 


where So = 5O les 
According to Theorem 2 in the previous section, we have already an asymptotic 

59-59 
Є 


expansion of the density function of С© = up to e" -order, denoted by 


Ўсе м0). 


Therefore, an approximation formula of ће call price is given as follows: 
Call(K, T) = Ет — K),] = E [(сг' (55?) - к) | (56) 
+ 


x à (Сга + 5P) - к) feo y Gods, (57) 


y 


where y = Ci(K) — 509., 
A simple example is the following. Set the local volatility function to be linear: 


dS, = o(X;)Sıd W; 
dX} = Vj (X)dt + V/(X)dW, (j =2,...,d). (58) 


For x = (xl, x?,..., x4), let 
C(x) = (log x!, x?, m ‚ x), 
and set (c) = є* where k is 0, 1 or 2. Then, we have S/? = log S(?, where 
а8® = - окуи + eo(XP)aW,, (59) 
ах) = V (X9, dt + eV/(X(9))aW, (j = 2,..., а). 


This case corresponds to some existing researches. (e.g. [91, 92, 95, 96, 100]) 
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4.3 Examples 


This subsection shows more specific examples in the local-stochastic volatility 
model. 


4.3.1 CEV Model 


The first example is on the well-known CEV (Constant Elasticity of Variance) model 
(Cox [10]): 


dS; = с(5 SQ aW, c and So are positive constants, 6 є [0,1], (60) 
where the term s’ makes the level of с is of the same order for different 8. For 


x > 0, let us take the change of variable function to be C(x) = log(x/So), that is 
x =C1(%) = Sp exp(x). Hence, 5, = log = and we have 


" 1 / га ; га - 
d$ = = 502620 Digi toe D) qw, Sy = 0. (61) 
Next, we introduce a perturbation є є [0, 1], again as follows: 
p f eO / «GO ~ 
asp = -1 gaos; dt + єсе8-10% dW,; So = О. (62) 


where 7(€) = є/ and j is a nonnegative integer. 
Because 


Sr = C7! (8) = Spexp (si?) = Sp exp (69 + 50) : 


an approximation formula of the call price with strike К and maturity Т is given as 
follows: 


Call(K, T) = Ег — K)4] = E |(s exp (69 + 59) Е к) | 


il e(0 
Ri Г (so exp (s + $$) — к) fogw, y(x)dx; (63) 
s 
Е К » 
y = С(К) — $0 = log “= 50. (64) 
0 


Note that f;,,, the first term in the asymptotic expansion of the density fg isa 
normal density and hence, the underlying asset price is expanded around a log-normal 
distribution. Thus, we could call this case a log-normal asymptotic expansion. We 
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also remark that the case of (c) = є® = 1 is harder to be evaluated than the other 
cases, which is essentially due to difficulty in computation of se for (e) = 1. 


e On the Validity of the Asymptotic Expansion for CEV model 
Previous works such as [85, 86, 107] have considered an asymptotic expansion 
of (average and vanilla) option prices based on the following type of a perturbed 
process: For 8 € [1/2, 1), 


45° = «(509 v of aw,; SO = so. (65) 


Although the coefficient function in this model is not smooth at 0, the asymp- 
totic expansion method is still applicable. For instance, we could use a smooth 
modification technique (e.g. [106, 107]). That is, let us take a modified process 
(550) срт of (5 © у,его,т] as follows: 


d5 = eg(S9)aw,. (66) 


Here, g(x) is a smooth modification of g(x) = (x V 0)“ such that g(x) = x? when 
x > a, for some small a; є (0, а) fora = 150 and g(x) = 0 when x < a» for 
some a» € (0, ат). Specifically, we may set g(x) as follows. For t € [0, T], 


g(x) = h(x)x? 
W(x — a2) 
h = 
ca w(x — a2) + (a1 — x)’ 
w(x) = e7! forx > 0, w(x) = 0 for x < 0. 


0<a <a, 


NE 
Suppose that for a R-valued function f, E [esp < ooand E [ О | < 


оо. (e.g. we can take option payoff functions as f in our setting.) Then, we have 
2 1 : ] 
E|| £68) - ETE С [Por]? Әр ) 
1 
x P (is® £ $0j) 2 
It also holds that 


Р (s £ 5) = P ({S* < а for some t € [0, T]]) 


< Zl sup |$; — 59| > а) 
Oxt-T 


С < ај for some t є [0, T]) { sup |S? — S| < 2] 


0<г<Т 
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We can easily see that the second term after the last inequality is 0. The first term 
is smaller than any e" for n = 1,2, . . . by the following lemma of a large deviation 
inequality: 


Lemma 4 Suppose that Z$, t € [0, T | follows a process of the solution to the SDE: 
dZ; = u(Zj)dt + ec (Zp)dW,. 


where u(z) satisfies the Lipschitz and linear growth conditions, and o (z) satisfies 
the linear growth condition. We assume that the unique strong solution exists. Then, 
there exists positive constants сі and сэ independent of є such that 


P({ sup |Z — 20| > c} < avexp(-c2€ 7) (67) 
O<s<T 


for all c > 0. 


The lemma can be proved by slight modification of the Lemma 5.3 in [119] or the 
Lemma 7.1 in [50]. Note also that S€ and $ satisfy the conditions іп the lemma 
above. 
Hence, 


E яс) = £6] = of"), n21,2,.... (68) 


Therefore, the difference between f(S) and f (S9) is negligible in a small 
disturbance asymptotic theory, and hence we could apply an asymptotic expansion 


to E [у ($e instead of E [ f (S'9)]. 
In particular, [107] considered the case that 2 = 1/2 and f(x) = ( i (a xdt— 


K)*, x = S(9, $© (an average call option’s payoff). The similar modification 
could be applied to the asymptotic expansions for transformed processes in this 
section. Please also see [88] for numerical experiments under the smooth and 
bounded modification of this kind for volatility functions in a HJM-type model of 
interest rates. 


4.3.2 SABR Model 


Next, let us consider a stochastic volatility model so called SABR [33] (or A-SABR 
[38]) Model: 


dS; = о‹(8/ 5) ^ )dW]; So > 0, (69) 


do, = № — oj)dt + vord W?; оо > 0 


where 8 € [0,1], 4 > 0,0 > 0, v > 0, and W = (W!, W?) is a two dimensional 
Wiener process with correlation p c [0, 1]. 
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Log-normal Asymptotic Expansion 
Let us take a log-normal asymptotic expansion for the underlying asset price 5, 
that is for х] > 0, set C (x1, x2) = (log(x1/50), x2) and S, = log ©: 
z 1 Р Я Р a 5 
d$, = етти + ое8-08 qw!: $$ = 0 (70) 
do, = № — o;)dt + vord W?; оо > 0. 


Next, we introduce a perturbation є є [0, 1], again as follows: 


ase =— mo с2е208—1)5° qt + єсе(8—108:° qw, So = 0, (71) 
do}? = (Л — of? )dt + evo dW); o® = оу, 


where т}; (c) = e}, i = 1, 2 and j; is a nonnegative integer. 

For instance, typical cases are 72(€) = ©? = 1 with n2(€) = є (an extension of 
the log-normal asymptotic expansion in [95, 100]), or т (e€) = e (an extension of 
[90] to the CEV-type local volatility). 

An approximation formula of the call price with strike К and maturity Т is given 
as follows: 


Calli; T) = Е[(5т — OSE |(s exp (c^ T 59) = к) | 


оо PN 0) 
x 1 (5 ехр (s + 8 ) = к) Ўсе м dx; (72) 
" 
" K 2 
y = С(К) – $0 = log == 50., (13) 
0 


Again, we note that the case of (c) = e? = 1 is harder to be evaluated than the 
other cases, which results from difficulty in computation of 50 for (e) = 1. 
СЕУ Asymptotic Expansion 

Let us take change of variable function C as C (x1, хо) = (C1(x1), x2) for (x1, x2), 
where for x > 0 and б є [0, 1), 


] xl-? * dz 
0 MU 


СГ) = HU — B) 53m, (75) 


That is, 
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Then, as S, = С1(5;), we have 


I UR ilg aW); S 3 0 (76) 
=—— = 3 => > 
Beg tu T RT 


do, = № — odt + vord W?; оу > 0. 


45, = 


Again, we set a perturbed process as follows: 


«o. me B (ә Gout: 26... 1 
dS; = 2 1-8 pu) qoute dW,; So 71-58 (7T) 
d (ө _ 20 (© 2. P = 
o = ™mm(e)A(O — о, )dt + evo, аў; = 00, 
where n; (c) = e}, i = 1, 2 and j; is a nonnegative integer. 
For illustrative purpose, let us set 7; (є) = m (€) = e. That is, 
«o. є B oy los. 66 1 
dS, = 21—38 ) galt te dW, ; So =т—8' (78) 


do? = «0 — o®)dt + n с? = 09. 


In this case, as se = 715 апа c = со for all t € [0, T], the first two terms in 


the asymptotic expansion, 91; = D + б $© follows a Gaussian process: 
| = 
-603 1 


dt + codW]; фо = 1-7 (79) 


4ди = 


Then, by applying /t0’s formula to 


àv = Сг) = Sol - map T, (80) 
and using 
1—8 
~ 1 Ii 
Mt —1 31-8 (81) 
1—8 So 


ди = 0: 


Bos} g Pe 0. А 
Mid n MEN. s|- i451 PP |а + 0557 PaP dwl; Gio = So. (82) 
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Here, because the diffusion coefficient of 01; is given by со cs (91 DP and we may 
think that $ is expanded around д1, we call this case a CEV asymptotic expansion 
(though 01 is not exactly a СЕУ process). 

In particular, when 8 = 1/2, 


4ди = = 


vis 


“0 [- у 508. + so] dt + соу Sog d W}; G10 = So, (83) 


and because 


А So 
фт = FEE (84) 


Фт/ (500% Т /4) follows a non-central x? distribution, around which the original 
underlying asset price $т is expanded. 

Finally, for т; (€) = є/', i = 1, 2 and j; is a nonnegative integer, an approximation 
formula of the call price with strike К and maturity Т is obtained as follows: 


- E [([sa - gms] N 


- B| ( LU- c» (Spy) — к) | 


| 
eene cime] 


Call(K, T) = E[(Sr — K)4] = | (сг' (Sr) - 


[ 
sf ({ Soa - «e» 9) (х + se NA a) K) foo, y 00 dx; 
(85) 


so. 1 (KY sz» 


As numerical examples, [97] examines normal, log-normal and CEV expansions 
up to the third order for approximations of option prices under SABR model, 
which implies that CEV expansion provides the most stable approximations. We 
also observe that CEV expansion becomes more precise with the same level of 
absolute errors across the whole range of 2 along the higher order expansions. 
Thus, we expect a higher order CEV expansion will produce the better and more 
stable approximation than the other expansions, though further investigation seems 
necessary. Please see the original paper [97] for the detail of the numerical exper- 
iment. 


Remark 6 If necessary, applying a similar technique as mentioned in Sect. 4.3.1, 
we could use the asymptotic expansion for a model with smooth (and bounded) 
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modification of the underlying processes. For а concrete example please 
see Remark 3 in [97]. 


5 Improvement Scheme for Asymptotic Expansion 


Although the asymptotic expansion up to the fifth order is known to be sufficiently 
accurate for option pricing (e.g. [81, 95, 96, 108, 109], one of the main criticisms 
against the method would be that the approximate density function admits negative 
values typically at its tails that is, some region of the deep Out-of- The-Money (OTM), 
which could create an arbitrage opportunity in option trading. Also, even if the domain 
of a true density is restricted to be positive, the domain of its approximation may 
include negative values unless an appropriate boundary condition is assigned. To 
overcome the problems, we briefly introduce two recent researches related to the 
present asymptotic expansion approach. 


5.1 New Improvement Scheme for Approximation Methods 
of Probability Density Functions 


Takahashi and Tsuzuki [98] develops a new scheme for improving density approx- 
imation methods, which also provides precise approximations of option values. 
Specifically, the scheme is inspired by the idea in the Hilbert space projection theo- 
rem, and so called “Dykstra’s cyclic projections algorithm" is applied for its imple- 
mentation. (Please consult Deutsch [14] for the detail of the algorithm.) We also 
remark that the scheme can be easily implemented in practice, where we need only 
market data used for usual calibration such as option prices with strikes. 

Furthermore, numerical experiments for vanilla option pricing under SABR model 
demonstrate the validity of the scheme. In fact, in terms of approximation accuracies 
this scheme improves the third and fifth order asymptotic expansions preserving 
the required conditions such as nonnegative densities under an appropriate forward 
measure. 

We finally remark that the scheme is general and flexible enough to include a set 
of conditions and information as one would like to put on an approximate density, 
and it can be applied to approximation methods other than the asymptotic expansion 
method. For example, a number of researches have been going on in order to extend 
SABR model with fixing the problem of the negative densities in the method of 
[33]. (For instance, see Doust [15].) We note that the scheme is also a candidate for 
handling this issue. Also, the estimate of the absorption probability based on Monte 
Carlo simulations as in [15] can be consistently incorporated in the scheme. 
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5.2 A Weak Approximation with Asymptotic Expansion 
and Multidimensional Malliavin Weights 


Takahashi and Yamada [105] develops a new weak approximation scheme for 
expectations of functions of the solutions to SDEs. In particular, the scheme con- 
nects approximate operators constructed based on the asymptotic expansion. More 
concretely, a diffusion semigroup is defined as the expectation of an appropriate 
function of the solution to a certain SDE: for example, P/ f(x) = E[f Ce] 
with the solution X/"° of a SDE with perturbation parameter є and a function f. 
Then, we approximate Pf by an operator Ог" which is constructed based on the 
asymptotic expansion up to a certain order т. Thus, given a partition of [0, Т], 
т = ((to,tj,..., 19) :0 = tọ < tj < --- < tn = Т}, we are able to approxi- 
mate Ру f (x) by connecting the expansion-based approximations with the use of 
multi-dimensional Malliavin weights sequentially: that is, roughly speaking, with 
Sp— бр = tg К = l,...,n, 


PÉTQ E QE a 


The present research justifies this idea by applying Malliavin calculus, particularly, 
theories developed by Watanabe [111] and Kusuoka [52—54]. In computation, in order 
to evaluate the Malliavin weights, the paper makes use of conditional expectation 
formulas for multi-dimensional asymptotic expansions in [86]. 

Moreover, the paper shows through numerical examples for option pricing under 
local and stochastic volatility models that very few partition such as n — 2 is mostly 
enough to substantially improve the errors at deep OTMs of expansions with the first 
or second order (m — 1, 2). 


6 Asymptotic Expansion in an Instantaneous 
Forward Rates Model 


Among main stochastic models in finance, there exist models in which the stochastic 
processes of the underlying variables do not belong to the class of diffusion processes. 
This section illustrates an instantaneous forward rates model as a typical example. 


6.1 Asymptotic Expansion for General Wiener Functionals 


Watanabe [111] derives an asymptotic expansion for general Wiener functionals. As 
an example of the Watanabe's expansion, [100] shows the following result: 


Theorem 4 Let us consider а family of smooth Wiener functionals F* = (Е{,..., 
Fy), Ff € D” (i = l,...,n) such that Ff has an asymptotic expansion іп D?*. 
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Moreover, F* satisfies the uniformly non-degenerate condition: 


lim sup || (det gp) | lr» < оо, for all p < оо, (87) 
є}0 


where o fe stands for the Malliavin covariance matrix of Е“. Then, for a Schwartz 
distribution T € S'(R"), we have an asymptotic expansion in R: 


N 
E(T (F5) — n rop" code + d | T(X)E 
R” j=l R" 


(j) k 
Уно (Tp) =x | p dx [| = 00+), 
k 1=1 
(88) 
Equivalently, 
E[T(F – | nu 
м Фф 
+} е хе» f TŒ) a 
j=l k E 
К 0 
x | [n БОЛЕ 1 pl «| as} = (ety 
1=1 
(89) 
where E == h 2i B es =, k € N (i = 1,...,n), а denotes a multi-index, 
a® = (а, ..., ag) and 


= E Ek 


k=1 Bi mJ BRL a efl, n} 


p^ (x) stands for the density function of F? . The Malliavin weight Н „о is recursively 
defined as follows: 


Hao (F, G) = Hiap (F, Haa- (F, G)), (90) 


where 


Ho (F, G) = (Уор к) (91) 
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Here, Е; € Р®, G є D”, D* ($7 у СҮРЕ) is the divergence of У? у СҮ 
DF;, DF; is the Malliavin derivative of F;, апа ү? = (5), __ denotes the 
<i, j<n 


inverse matrix of the Malliavin covariance matrix of F. Moreover, we use the notation 
/ Т (x)g(x)dx for T € S'(R") and g € S(R") meaning that < (Т, g)s. (See the 
Sect. 2 of [100] for the details of those definitions.) 


Remark 7 'The asymptotic expansion formula (89) is the formula developed by 
Watanabe [111]. Hence, this theorem shows the expansion (88) based on push down 
(conditional expectation) of Malliavin weights (divergences) is equivalent to the 
Watanabe's formula. 


(Proof) We use a as an abbreviation of a in the proof, and the notation 


(5 ) pr iine is defined as follows: 


(Т, EE] s T, EP] pP s. 


pF 0 (х)ах = 

Under the uniformly non-degenerate condition of Е є D??(R"), the lifting up 
of T € S'(R?) that is, (E Bos has the asymptotic expansion in distributions on 
the Wiener space 07°, that is for N € №, there exists s € N such that 


N (G) 
(ЕК УТ — {T o F° + 2, 2,047) о "р БОА 
р, 5 
= Ot) єє (0,1, q < oo. (92) 


Then, there exists an asymptotic expansion of ((E^ )*T, 1)p-«, psc. 
The push-down of the divergence are computed as follows: 


k 
tray ID _ (тео) н, (^s ELS б | 
{=1 D-?9xp^9 = оо у Doo 


D- 
= (r ЕЁ? С (r йл) у, 
0 хах 
= [A ТО)Е Ç (^n Es ) ба p одах. (93) 


k 
(ar. ЕЁ? [n cul 
1=1 pF? (х)ах 


k 
k pF? 0,8 
= т (KE [n ET J) 
1=1 pF? (х)ах 


On the other hand, 


C [|6 d 
1=1 D- хро 
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k 
- c» f T (x)0* |z i ЕЙ F? = J a dx. 


i=! 
(94) 


Here, ay means (9*)* = 0^ .-- O% (k times), and д* denotes the divergence oper- 


ator on the space (R", p^ m А 


Corollary 1 The asymptotic expansion of the density function of F*, p (у) is 
expressed with the push-down of the Malliavin weights as the follows: 


m (j) k 
є 0 : | 0 
pF) = ре) + dE р Haw (= П a^) zia | py) + 0(е"+1), 
k 


j=l 1=1 


(95) 


where p* (y) is the density function of Е. An alternative expression is given as 
follows: 


m G) 4 
pP) = po У е Yon [: i ЕДЕ? = J p o] + o(ent!), 
j=1 k 


1=1 


(96) 


(Proof) Take a delta function б, є S'(R") in the theorem above. 


6.2 Instantaneous Forward Rates Model 


As a typical stochastic model for pricing the interest rate derivatives, there exists a 
model developed by Heath-Jarrow-Morton [37], the so called HJM model, which is 
formulated based on the forward rates with infinitesimal terms of the interest rates, 
that is the instantaneous forward rates (£/(s, 1$) : 0 € s € t < Т}. Here, s is the time 
when the forward rate is fixed and ¢ denotes the inception time when the forward 
rate is applied. 

The stochastic processes for the instantaneous forward rates are considered in 
the framework of the asymptotic expansion by introducing a parameter є є [0, 1]. 
For example, let W be a m-dimensional standard Wiener process and let f(0, t), 
t € [0, T] be a given Lipschitz continuous function of t. Then, under the equivalent 
martingale measure, the stochastic processes of ( f(s, 7) : 0 < s < t < T} are 
solutions to the following stochastic integral equations: 
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sm 


t 
/® в) = f0.0 4 8 ] 5 EDD / G9, у), v уду de 
0 i=1 v 


m 5 
+È | обоо, опата) ie e [0 I (97) 
i=1 79 
where the volatility functions (o; (х, s, t); i = 1,..., т} are smooth and satisfy the 


regularity conditions which guarantee that the equation (97) has its unique strong 
solution. Itis to be noted that the drift term (the coefficient of the d v term) of f (9 (s, t) 
depends on { f (v, у); 0 < v < s,v < y < t]. Moreover, the stochastic process 
of the instantaneous short-term interest rate r9 (г) is determined by the relation, 
r9 (t) = FOC, r). 

For this model, the approximations of the values for interest rate derivatives can 
still be considered in a unified framework with derivation of asymptotic expansions 
of the instantaneous forward rates when є |, 0 and with use of the relation between 
the instantaneous forward rates and a zero-coupon bond price: 


T 
PO(rT)- exp {- / FE, udu : (98) 
t 


As an example, we consider pricing an option on a coupon bond (or a swaption), 
which is a standard interest rate derivative. The payoff at maturity of a call option is 
given by 


V.(T) = max [Serre Т) — К, | i 


1=1 


where 0 < T < Т «€ --- < Т, cj (i = 1,..., п) are positive constants and К (> 0) 
is а strike price. Then, its present value is given by 


V.) = E [c7 ^ "уту. (99) 
When є | 0, the forward rate f (€ (s, Р) is expanded around / (0, t) as 
FOCN~ РО, 0) + efi D E fo D inD?, — (100) 
where the coefficients of є", n = 1,2,..., that is fi1(t, и), fo(t, u)--- are also in 
D”. 


As a result, we obtain an expansion of the zero-coupon bond price P9 (r, T) 
around the current forward bond price POT) and an expansion of the discount 


factor exp [- Н г (а | around the current zero-coupon bond price Р(0, Т) as 
follows: 
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P(0,T 
PO(t, T) ~ ш 


F F 
POH -ef ладан | f(t, u)du 
‚1 [7 
єє n леда) Je іп D”, 


T T 
- fo r9GMs „ру, T) [ = 1! fit, dt — ef falt, tdt 
0 0 


SEN i — 
«elf леда +- іп D”, 


where f;(s, t), i = 1, 2 are given by 


(101) 


(102) 


fils, t) = 25569, , [xe DAW; Qv), 


1 9? f'9 (зи 
fas, t) = 4 5824... 


- f b (v, thdv + n > до}? (v, 1) fi (v, ато). 
0 0 i= 


Here cO (y t) = с; (f 9 (v, t), v, t), and b® (v, t) and do (y t) are defined as 


n t 
и = Ye Pts Dn | ас.» уду, 


= x, v, t 
Jo (v, t) = LT f(0,0- 


Therefore, in a similar way as in the framework for diffusion cases in the previous 
sections, we define х1! апа xO (i = 2 


t 
x = a|- / ПЛ 
0 


.,n)as 


(103) 
Ti 
xO" = PO, Т) = exp |-/ fO, ode] ,i=2,...,n. (104) 
t 
Then, the payoff at maturity of the call option on a coupon bond is written as 
n | 
V.(T) = тах [5 ox! =k, | (105) 
i=2 


Moreover, let x = (x1, x2, ..., Xn) and define g(x) as 
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g(x)2x (x CjXj — к) Р (106) 
ї=2 


In this way, we are able to employ a similar technique to pricing derivatives as in 
the case of diffusion processes. For example, with redefinition of variables such as 
Ут, the approximation of the option price V.(0) in (99) can be obtained based on 
the almost same asymptotic expansion method as in the previous sections. In fact, by 
using the above expansions of instantaneous forward rates, zero-coupon bond prices 
and the discount factor, we can apply the expansion to E[max{g(X С ) ), O}], where 
Xj eg oe С 

For the details and numerical examples, please see [49, 50, 88]. In particular, 
[88] implements numerical experiments under a smooth and bounded modifica- 
tion of two factor CEV-type volatility functions (as explained in Sect. 4.3.1), and 
the variance reduction technique in proposed in [107] to demonstrate the effective- 
ness of the method. We remark that the boundedness of the volatility functions 
{oj(x, s, t); i = 1,..., т} for the instantaneous forward rates fe (s, t) is one of the 
sufficient conditions that guarantee the existence of the unique strong solution of the 
stochastic integral Eq. (97). 

For evaluation of other various interest rate derivatives, approximations based 
on the asymptotic expansion approach can be derived in the similar manner. More- 
over, an example of an approximate formula for derivative prices dependent on the 
instantaneous forward rates in the HJM model and other variables following general 
diffusion processes is given by [85]. 


7 Asymptotic Expansion in Jump and Jump-Diffusion 
Models 


So far, we have used stochastic models whose randomnesses are generated by only 
Wiener processes. However, we are also able to apply the asymptotic expansion 
approach to stochastic processes including jumps in their sample paths. This section 
provides its very brief review. For the details, please see the cited papers. 

In terms of the mathematical viewpoint, Yoshida [120] presented an extension 
of Watanabe theory to develop a framework for providing a validity of asymptotic 
expansions in Wiener-Poisson spaces, which can be applied to jump-diffusion models 
under some regularity conditions. Hayashi [34] applied a Malliaivin calculus of jump- 
type to prove an asymptotic expansion theorem for functionals of a Poisson random 
measure, and Hayashi [35] derived the coefficients in the expansion of a call option 
price under a pure jump model. Moreover, Hayashi and Ishikawa [36] proved an 
asymptotic expansion formula for the compositions of a smooth Wiener-Poisson 
functional with Schwartz distributions. 
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In direct applications to finance problems, [51, 87] derived asymptotic expansion 
to approximate bond prices or/and plain-vanilla option prices under jump-diffusion 
with local volatility models. 

Subsequently, [93, 94] found a new expansion scheme for pricing long-term Euro- 
pean currency options under a Libor market model (LMM) and a general diffusion 
stochastic volatility model with jumps of spot exchange rates. Particularly, thanks to 
a linear structure of the underlying asset price process in their model, they separated 
the jump component with a known characteristic function to apply the expansion 
technique developed in the diffusion models. Also, [100] took a Malliavin calculus 
approach to derive asymptotic expansions of vanilla option prices in a jump-diffusion 
with stochastic volatility model. 

Recently, [80] has generalized the preceding researches such as [51, 87] and 
[100] in the asymptotic expansion approach, and developed a new approximation 
formula for pricing basket options in a local-stochastic volatility model with jumps. 
In particular, the model admits local volatility functions and jump components in not 
only the underlying asset price processes, but also the volatility processes. Moreover, 
they implemented some numerical experiments to confirm the validity of the method. 
Please see the paper for the details. 

As an example of asymptotic expansions of option prices under jump-diffusion 
models, the next subsection describes the outline of the method by using a simplified 
version of [80]. 


7.1 Pricing Basket Options Under Local Stochastic Volatility 
with Jumps 


In the first place, we define the model of the underlying asset prices and its volatility 
processes, which is used for pricing the European type basket options. In particular, 
suppose that the filtered probability space (О, F, Р, {F+}:>0) is given, where P is an 
equivalent martingale measure and the filtration satisfies the usual conditions. The 
risk-free interest rate is assumed to be a nonnegative constant r for simplicity. Then, 
(SD, его, т] апа (a!);ef0,r], i = 1,...,d represent the underlying asset prices and 
their volatilities for £ € [0, Т], respectively. Particularly, let us assume that LA and 
gi, are given by the solutions of the following stochastic integral equations: 


T T Р 
Sp =s «f о S; dt «f Psi (si. si) ам 
0 0 


п Мт | T . 
+> Улу 5 — | ASi Ел , jldt |, (107) 
l=1 \j=1 


T Т | 
ol = oi + f A — о! + | bai (ei Jaw” 
т "0 = oi (Or 1 
0 0 
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T 
+ [Уло -f Ас! _E[hgi ; jldt |. (108) 
1=1 \j=1 0 

where a, and 85 i = l,...,daregiven as some constants. The notations are defined 
as follows: 
e o! (i = 1,..., а) are constants. 

A! and 0' (i = 1,...,d) are nonnegative constants. 
e 05: (x, y) and ф(х) are some functions with appropriate regularity conditions. 
e WS апа W”, (i = 1, , d) are correlated Brownian motions. 
e Each Nj, (| = 1, nj is a Poisson process with constant intensity Ay. №, 

[= 1,...,п аге ети and also independent of all wS and W”. 
e Tj] stands for the jth jump time of №. 
e For each |. = 1,...,n and i = 1,...,d, both (es һул) Qmd 


(25 hi | i). 20 are compound Poisson processes. ‘ome jal = 0 when №; = 0.) 
E кзы 


For each / and x', Ле, (j € М) are independent and identically distributed 
random variables, where x! stands for one of SÌ and с! (i = 1,...,d). 


— for the log-normal jump case, Лу = = ені — І, where 


Y, jisa leu variable which follows a normal distribution with mean m ү | 


and variance y: x т that is, N (myi р, ус: у) for all j. 


t 
h 


hy ; ; and hy Ж (j # Ј') are independent. 

Ni and hyi y. j ‚аге independent. 

For the same / and j, лс у : and ho, { i (i,i’ = 1,...,d) are allowed to be 
dependent, that is Ys; ; ; and Y; j ® i‘ = 1,..., d) are generally correlated. 


xil, j and hu Ж (I £1’) are independent. 


Remark 8 By — ^^ о ф and фе, we can express various types of 
local-stochastic volatility models. For example, the model with Ф (с, 5) = (a S? + 
bS + c) /a and ф (c) = y/o corresponds to an extension of the Quadratic Heston 
model. The model with фс (о, 5) = S P5 с and фо (c) = c corresponds to an extended 
SABR (A-SABR) model, and the one with ¢s(o, S) = S/so and ф„(о) = сб 
corresponds to a local volatility on volatility with jumps model. 


Next, we introduce perturbations to the model (107) and (108). That is, for a 
known parameter є є [0, 1] we consider the following stochastic integral equations: 
fori =1,...,d, 


А Я T ou i 
pu = i+ f о? SO qr 4 Ji феі (с ‚© oS m) dw; 


(6) gh © i,(€) (є) 
+2 Daa se-f AS ЕАО, dr]. — (109) 


1=1 
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qi -%+/[ NB пае f" d (oi) awe 
0 
n [Nur 


Т 
(е) i,(€) i,(€) (©) 
+ > > hoi 29 - | Ліс; ЕЛ 114 " (110) 
1=1 \j=l 


where h9 15 = e л.) — 1, that is, we assume that the jump size follows a log-normal 


distribution, «У,у р ~ N(em.i |, e D p 


) 


We assume the asymptotic expansions of 57. 9 ang gu ‚© around e = 0 as follows: 


i,(€-) _ «і, (0) i,(1) e i,(2) 

Sp = Spo +eSp + 9157 Tee, (111) 
i.c) ai: (0) i,(1) a (2) 

op = + eo. ur T, (112) 

2 
(є) (0) a) € 2) 
hii Lj =h, Lj + ШИЙ + PIDEN ariy (113) 
"AGI 
iO ._ OS; iO 0) до ОИТ 
where S, =e 0 op = —Bb— e Bij = бе j = 


We also suppose that ws’, m ws. we” „ты, wey = 0. Z where p is a 
2d x 2d correlation matrix, and Z is a 2d-dimensional (independent) Brownian 
motion. 

For ease of the expressions we introduce the following notations: 


e Фу у :— Ф (о', 5) (0): у and Фор = Фи (o')(0)a+i,j, where (о); ; denotes 
the (i, j)-element of o. 

e Oy := (Og |,..., Фу 24) and 6, :— (Фә |... 
vectors. 

e Фу := (Ф<!,..., Psa)’ and 65 := (Ф1,..., P a) are d x 2d matricies. 

e We define a operator e “x” as follows: When A and В are d x 2d matrices, 


., $i 54) are 2d-dimensional 


(A) (B>, c CAJ124(B)124 
Аж В := : N : А (114) 
(А)а,1(В)ал ++: (A)a, 2a (B)a,2a 


When A is ad x 2d matrix and B is a d-dimensional vector, 


(A10) = (OD1234()1 
AxB=BxA:= : e : : (115) 
(А)а1(В)а ++ CA)a2a(B)a 


When A and B are d-dimensional vectors, 
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(Aji (B)1 
AB: | (116) 
(А)а(В)а 
e We also define ôy ^s (x = S ог о) as 
d (Фе) ses der (Ds)124 
0,0, = : n (117) 
zr (O5)a,i x 921 (®s)a,24 


where (Ф); j denotes the (i, j)-element of the d x 2d matrix Фу. 
e Let us introduce the following notations: 
Si = (5/,..., SO, о = (0 ео), 


G) 0 GO © G) aÒ © 
Ты, gph eg = Ay pe AE 0), 


, 1 4 1 а 
e?! = (е®1,...,е® t) апаем = (eùt, ... e). 


Based on these preparations, we obtain the next proposition. 


Proposition 2 The coefficients, gu с? апа hu =,S,o), i = 0,1,2 in the 


expansions (111), (112) and (113) are given as follows: 


SO = eo? x so, (118) 
o =04 (co — 0) «e, (119) 
© 
Aj = 0, (120) 
T 
p = е0 y os (of, S9) az, 
0 
n М.т 
+ У [40 -ATE DW « SQ, (121) 
l=1 X j=! 
T n Ni. T 
cQ = f pt eum. (0) 4Z, + | DAD, жеоо + o 
0 1=1 \j=1 
1 т 0 
-AE |n |же Я! ем s sar) | (122) 
Е 0 
1 
Se edge sl (123) 


T 
50 = n eT 4 Ass (20, 50) + 5042, 
0 


Т 
+ | ears (s. 59) x ofaz, 
0 
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n Мт 
+5 ( ХЕ - ATE [A] |50 
1=1 j= 
Мт 
+} һу), x eT- „та 
j=l 
T 
- AE [AG Jeers f et 50а), (124) 
1, А 
is xj * х,у. (125) 


Next, let us define the payoff of a basket call option with strike price K as 
(g(x) — К)? (= max{g(x) — K, 0}), (126) 


d 
g(x) = w:x = У wix', 


i=l 


where g(x) represents a weighted sum of the underlying asset prices of x1,..., ха 
with the constant weights w|,..., wg. Here, we set x :— (xl, ..., x4) and w :— 
(w1,..., ша). 


For an approximation of a basket option price, we firstly note that g (st?) is 
expanded around e = 0 as: 


(59) = = (si?) +g (se) +< 4 ze (st?) + o(Q). (127) 


Then, for a strike price K — g (S) — ey for an arbitrary y є R, the payoff of the 
call option with maturity T is expanded as follows: 


(ss?) к)" = (soe ә) 
= є (9 (5°) + 59 (si?) +y+ ZN 


2 
= (обу + j т TD (sp?) +00). (128) 


We next note that when the number of jumps is kj (1 = 1,..., п), that is on 
{№ = kj} := {Nir =ky,...; Nar = kn }, sP i in the equation (121) becomes 


£u) + Sr, (129) 
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where 
n 
£i = J (ki — ArT)ms, ж SP (130) 
1=1 


апа 


T n ki 
ST := n e2 T-t) y, Ds (er, sp?) dZ, + > » YS * Gs. j.l * v Я (131) 


l=1 \j=1 
Here, we use the following notations: 


ө 75, = st - -> 754) 
e (5,31 = (Сз! ыз» Сва j.1) is a vector of random variables, where Csi jul follows 
N (0, 1), that is the standard normal distribution. 


We remark that the distribution of g(Sr) is N (0. = n ) that is the normal dis- 


tribution with mean zero and variance x whose density function is expressed 


as 


(132) 


1 m 
n (s: 0, x) = ———— СЕ . 
fara} 257 
Неге, DE 0 is defined as follows: 


T T 
xx = | (w x eT) y, Ф; (er s)) (w x eT) y, Ф; (o, se dt 


0 
n 
0 0 
+} Qo ys ж $5) Des (шж узу Sp), (133) 
i=) 
where 0с; , stands for the correlation matrix of Gs, j = (Csi, ju. +--> бда, jj), and xt 


denotes the transpose of x. 
Next, we define 


то. Kp = E [o (SP) | = x, 1M: = kn]. (134) 


With those preparations, we approximate the expectation of the basket call payoff 
under an equivalent martingale measure in the following way: 
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«[e)- 9) 
= (ов e) lar) = х, {№ =) 


2 ^ 
+E |: | fash}? (s?) e =, tN = | +00). (135) 


We also note that the probability of {Л = kj} := {Nir = ki, ..., Мт = kn} 
is expressed as 


n k AIT 
(А 2 ге 
pi = [| ———_. (136) 


1=1 


which is the product of the k; times of ће jump probabilities of №. т (| = 1,..., п), 
that is П; P((Ni,r = kı}), thanks to the independence of № т (| = 1,..., п). 


Then, we calculate the coefficients of є and E on the right hand of (135) as follows: 
The coefficient of e is given by: 


E |: [c (si?) + DN lar) = x, UN = zl 


=> » р wf. (x + об) + у) n6; 0, xU ax, (137) 


=0 у" kı=k (gE (ky) +) 


and the coefficient of = is given by: 


E |: | lP} (SP) PED = x, {м = zl 


=>, 2. vw f m(x, (n6: 0, 50 )ах. (138) 


k=0 Y? i =k —(g (Eik) +y) 


Then, the initial value, C(K, T) of the basket call option with maturity T and 
strike К is expanded around є = 0 as follows: 


С(К,Т) = 


99 оо 
2. 2. pue fe f (x + ушу) п(х; 0, xax 
— Yik} 


k=0 Y? =k 
is k 

+e / m, {kr})n(x; 0, Бах | + o(), 
— Yik} 


(139) 
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where уру :— g(£(g) + y, and r is a constant risk-free rate. 

In order to evaluate rj (x, {k;}), that is the conditional expectation defined in (134), 
we apply some formulas derived in Lemma 3.2 of [80]. 

Consequently, with є = 1 we obtain an approximate pricing formula for a basket 
call option, which corresponds to an asymptotic expansion of the basket option price 
up to the c?-order. 


Theorem 5 Az approximation formula for the initial value C(K, T) of an basket 
call option with maturity T and strike price K is given by the following equation: 


{ki} 
оо Hi (х iM ) 
= Yk {k Т 
> >, pme 7 br ( PH) (2 "401 = Аар 
yl x» 
= T 


k=0 Ya hk 


ey 
o Bom) Ы (ув; 0, zi. (140) 


k 
where р} = [Tra (QT te а , r is а constant risk-free rate, у = 96509) —K 
ун} = gu i y, N m denotes the standard normal distribution function and 


n(x; 0, >) = VA exp GE г). Неге, x is given by (133), and £j is defined by 
(130). C1, C» and C3 are some constants, which are given with the derivations in 


Appendix B of [80]. Moreover, Hx (x: У p denotes the kth order Hermite polyno- 


mial: particularly, Hi (x: Ep) — x and H» (s: zm =x – yo, 


8 Perturbation Scheme in Forward Backward Stochastic 
Differential Equations (FBSDEs) 


The FBSDEs have become quite popular in finance community since El Karoui, 
Peng and Quenez [16], especially after the recent financial crises and the subsequent 
quite volatile markets, which leads us to recognize the importance of counter party 
risk management, particularly the credit value adjustments (CVA). 

However, an explicit solution for a FBSDE has been known only for a simple 
linear or quadratic example. Although several techniques have been proposed in 
the last decade, they seem very limited in practical applications since they rely on 
numerical methods for non-linear partial differential equations (PDEs) or regression 
based Monte Carlo simulations, which are generally very difficult to implement or 
quite time-consuming especially for high-dimensional and long-horizon problems. 

Recently, [25] has developed a simple analytical approximation scheme for the 
nonlinear FBSDEs, notably for not only the so called decoupled cases but also the 
coupled cases. Fujii and Takahashi [25] has introduced a perturbation parameter 
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to the generator of a backward stochastic differential equation (BSDE) to expand 
recursively the non-linear terms around a relevant linear FBSDE. In the computa- 
tion of each order, [25] explicitly represents the backward elements as the functions 
of the forward components and take those expectations. Hence, except the cases 
that the distributions of the forward process are explicitly known, we need to apply 
some approximations of the distributions, and so, again, the asymptotic expansion 
technique for the forward stochastic differential equation (FSDE) is useful in the 
approximations. Section8.1 below illustrates the scheme briefly. Fujii and Taka- 
hashi [25] also provided two numerical examples, where the second-order analytic 
approximations work quite well compared to numerical techniques such as the finite 
difference method and the regression-based Monte Carlo simulation. Please see the 
paper for the detail. 

Moreover, their subsequent work [26] has applied this scheme to the optimal port- 
folio problem in an incomplete market with stochastic volatility, and demonstrated 
the accurate approximations even for long maturities such as 10 years, as opposed 
to the regression based Monte Carlo simulation which works well only up to short 
maturities such as one year. 

We also note that the method has a great advantage of deriving explicit expressions 
of the optimal portfolios and hedging strategies, that is very important in practice. 
Furthermore, we can employ the method for the general multi-dimensional cases. 

In order to achieve further reduction of computational burdens in this method, the 
scheme with an interacting particle method has been recently developed. Section 8.2 
describes the outline. Please also see [29] as an application of the method to American 
option pricing. 

Furthermore, [104] provides a mathematical foundation for the original scheme 
in the decoupled case proposed in [25]. (The justification for the coupled case seems 
an important and interesting research topic.) It mainly consisted of two parts. That 
is, for the BSDE expansion with a perturbed generator they have obtained the coef- 
ficients up to an arbitrary order as the solution to a system of the associated BSDEs 
with the base FSDE, and present the error estimate of the expansion. Accordingly, 
they showed a concrete representation for each expansion coefficient of the volatility 
component, that is the martingale integrand in the BSDE. For the FSDE expansion, 
they derived an expansion formula with its sharp error estimate for the expectation 
of the solution to the base FSDE in terms of a small diffusion. Then, they combine 
the both results, particularly applying the FSDE expansion formula to the BSDE 
expansion coefficients to obtain a main result, that is an asymptotic expansion of 
FBSDEs with a perturbed generator. In the proofs, [104] effectively applied the rep- 
resentation results in Ma and Zhang [63] for the BSDE expansion and the properties 
of the Kusuoka-Stroock functions in Kusuoka [52] for the FSDE expansion. 

In a different stream, [102] has proposed a new semi closed-form approximation 
for the solutions of FBSDEs. In particular, applying the asymptotic expansion method 
in [100] and [103] to the forward SDEs with a Picard-type iteration scheme for the 
BSDEs, they have obtained an error estimate for the approximation. Moreover, they 
demonstrated the effectiveness of the method through numerical examples for pricing 
options with counter party risk under the local and stochastic volatility models, 
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where the credit value adjustment (CVA) is taken into account. Roughly speaking, 
considering a perturbed forward SDE X^, = є (0, 1] and an associated backward 
SDE (Y*, Z^), they have the following recursive asymptotic expansion around some 
non-degenerate gaussian model Хо. That 15, fork 2-0, N > 1 


yos м prone x) = Elg(X}"")] 


T 
+E |/ f. xu. y ST ха 
1 


N 

+ Dee Elo Xp ig] 
i=l 
N 


T 
+} £E |/ f(s, xb yam с. А (141) 


i=l 


get ~ (уи), х) = | ERP ONO 


T 
у0,1,1 E k,N,t,x ek, N. tx n79, 
+E |/ JU Xo pomo, Me пома) 
ГА 


N 

+ EIRT ON r] 
i=l 
N 


T 
+} dE |/ gis Ab yoke zi pas leca, x), 
t 


i=l 
(142) 


where YEENEY — yekN (g, KOO) and ze Ni — (y ио) (5, X01). Here, 
the processes то: апа М, і = 1,..., N are the Malliavin weights and in particular, 


NQ , corresponds to the weight appeared in a representation theorem in Ma and Zhang 
[63]. 


6.1 Expansion with Perturbed Generator in BSDE 


This subsection briefly describes the perturbation method following [25]. Firstly, let 
us consider the following decoupled FBSDE: 


dV, = —f (Xt, Vi, Z))dt + Z; - dW, (143) 
Vr = Ф(Хт), 


where V takes the value in R, W is a r-dimensional Wiener process, and X; valued 
in R is assumed to follow a diffusion process, which is the solution to the (forward) 
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SDE: 
d X, = yo(X1)dt + y(X1)- dw; Xo =x. (144) 


Hereafter, we assume the appropriate regularity conditions that guarantee the math- 
ematical validity. For example, pleases see [104] on this point. 

In order to approximate the pair of (V;, Z+) in terms of Ху, we extract the linear 
term from the generator f and treat the residual non-linear term as a perturbation 
to the linear FBSDE. That is, let us introduce a perturbation parameter e, and then 
write the equation as 


dV = cx) Var — eg(X,, V , Z)at + Z® - aw, (145) 
V? = (Xr). 
Here, the above equation with e = 1 corresponds to the original model: 
f OG, Vi, Zi) = —с(Х,)\У, + G(X, Vi, 2). (146) 
We remark that as in the previous asymptotic expansion cases, the residual part g 
should be small for a precise approximation. Hence, one should choose the linear 
term c(X Qv in such a way that the residual non-linear term g becomes as small 
as possible. 


Now, we are going to expand the solution of BSDE (145) with respect to e. That 
is, suppose vo and 709 are expanded as follows: 


v E y a cv + e vo ibis (147) 
ZO ОИ +... (148) 


For illustrative purpose, let us show a first few steps of the expansion. For the zeroth 
order of e, it is easily seen that vo is a solution to the following equation: 


dv = оху а + zO . aw, (149) 
VO = (Xr). (150) 
Then, vo can be represented as follows: 
y= E |e- Л хрл], (151) 
which is equivalent to the value of a standard European contingent claim with the 


terminal payoff Ф (Хт) and the discount rate c( X;) under a suitable pricing measure. 
Clearly, v is a function of X; due to the Markovian nature of the model. Moreover, 
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applying Itó's formula (or the Malliavin derivative), we are able to obtain z" asa 
function of X, as well. 
Next, let us consider the process voy: 
ду® = v9) = ex (v? — vat 
= 69%, Ve". 7(©)ат + (zP? — Z®) aw, 

v? — y =0. (152) 
Now, by extracting the e-first order term, we can once again recover the linear 
FBSDE: 

ау = (XV dt — gan, V, Z)at + Z” - aw, 
v® =0, (153) 


which leads to 


T и 
y(? — E |/ ar eds A(X, yo. Z du 


t 


| 2 (154) 


Because vo and gi» are some functions of X,,, we obtain vi? as a function of X;, 
and also 20) through Itó's formula (or Malliavin derivative). 

Inexactly the same way, we are able to derive an arbitrarily higher order correction. 
Particularly, due to the є in front of the non-linear term g, the system remains to be 
linear in every order of the approximation. For example, v that is the €?-order's 
coefficient of the expansion of y is the solution to the following equation: 


д 
av = (XV Pdi — К? y 9. ZO yD 


4 V,g(X,, VO, z 9» . ze dt + ZO .aw, (155) 
vi? = 0. 


In general, suppose that we have succeeded to represent backward components 
(Vi, Zi) in terms of X, up to the (i — 1)th order. Then, in order to proceed to a higher 
order approximation, we need to obtain the following form of expressions with some 
deterministic function С (-) in terms of the forward components X;. 


; T u 
vo = Е |/ 2; eds б(х„)аи 


1 


я) . (156) 


Even if it seems impossible to get the exact result, we can still have an analytic 
approximation for (v, Z и ). through again, the asymptotic expansion method. 
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As an example, [26] has explicitly derived an approximation formula for the 
dynamic optimal portfolio in an incomplete market setting, and confirmed its accu- 
racy comparing with the exact result by the Cole-Hopf transformation (Zariphopoulou 
[121]. 

Finally, let us provide a brief remark on an approximation of coupled FBSDEs. 
Let us consider the following generic coupled non-linear FBSDE: 


dV, = —f (t, X, V, Z)dt + Z, - dW, (157) 
Vr = Ф(Хт) 
ах, = Y(t, Xi, Vi, 2)ат + ү@:,Х,,М,,2)-а\/,; Хо=х. 


We are able to treat this case in the similar way as in the decoupled case by introducing 
perturbations to the forward SDE in addition to the one in BSDE: 


dV? = c(t, X@)VOdt — eg (r. x9, v9, 2°) dt + ZO .aw, 
(ө _ (є) 

v? = Ф (x?) 

dX)? = (r (t x9) + en (4, x9, v9, 2) Jae 


+ (с (б xm ten (i х,у, 25) . dW, 


We also note that the similar method can be applied to the coupled case under a PDE 
(partial differential equation) formulation based on the so called four step scheme 
(e.g. Ma-Yong [62].) Please see [25] for the details. Developing a mathematical 
validity of the scheme for the coupled case will be one of the research topics in the 
future. 


8.2 Perturbation Scheme with Interacting Particle Method 


This subsection briefly introduces a new scheme proposed by Fujii and Takahashi 
[27]. Except the cases that we are able to obtain fully closed form expressions, the 
high orders’ expansions of perturbed FBSDEs generally contain multi-dimensional 
time integrations of expectation values due to a convoluted nature of the scheme, 
which makes standard Monte Carlo simulations too time consuming. To avoid nested 
simulations, one can applies a particle representation inspired by the ideas of branch- 
ing diffusion models (e.g. Fujita [23], Ikeda, Nagasawa and Watanabe [40—42], 
McKean [69], Nagasawa and Sirao [70]). Then, we are able to provide a straight- 
forward simulation scheme to solve nonlinear FBSDEs at each order of the approx- 
imation based on the perturbation. In particular, comparing to the direct application 
of the branching diffusion method, the method is expected to be less numerically 
intensive, because thanks to expansions of the perturbed generator, the interested 


Asymptotic Expansion Approach in Finance 403 


system is already decomposed into a set of linear problems. We illustrate the outline 
of the method by following [27]. 

Again, let us introduce a perturbation parameter e in the generator of a BSDE as 
follows: 


ай = —ef (Xs, V9, Zs? )ds + 209 - aW, (158) 
Vf) = Ф(Хт), 

where X; € К is assumed to follow a generic Markovian forward SDE: 
dX; = yo(X5)ds + y(X,)  dW,; X, = xi. (159) 


Next, let us fix the initial time as t. We denote the Malliavin derivative of X, (u > t) 
at time f as 


D,X, E В’, (160) 
Let us also note that in terms of the future time u, the SDE of @ defined by 
Qu, = ð X, is given in the following: 
AY), = ӘХ) (У, a du + дел, (ХО ау 
Qi; = 65, (161) 


where д denotes the partial differentiation with respect to the kth component of 
X, and б" stands for the Kronecker delta. Here, i апа j run through {1,..., d} and 
(1, ..., r) for a, and we adopt the Einstein notation which assumes the summation 
of all the paired indexes. Then, it is well-known that 


(D,Xi), = Frua), 


where a € (1,...,r] is the index of r-dimensional Wiener process. 
First, for the e-zeroth order, it is easy to see 


ve = уст] (162) 


79° = Ba; VC) Vir Y OG) 


Fil. (163) 


Then, it is clear that they can be evaluated by standard Monte Carlo simulations. 
However, for their use in higher order approximations, it is crucial to obtain analytical 
(closed form) approximate expressions for these two quantities, for example based 
on the asymptotic expansion technique as before. 

In the following, let us suppose that we have obtained the solutions up to a given 
order of the asymptotic expansion, and write each of them as a function of x;: 
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у® = vo (xi) (164) 
Zi egli. 
Next, for the e-first order's coefficient vin, we obtain an expression as 
(1) ? 
y® = / E| ros. AUR ZO |F au 
t 
T 
= / E| f (Xu, vO (X,), 209) |F; Jau. (165) 
t 


Then, we define the new process for (s > f) by introducing a deterministic positive 
process A, as follows: 


VO — eff ма үа), (166) 


Here, A, can be a positive constant for the simplest case. Then, for the fixed initial 
time f, its SDE is given by 


d V4? = А Vir ds — As fs (Xs, VO (Xs), zO (x,yas F el Aadu Z) аў, 
where 


^ 1 p 
fis Gc 9), 10) = sel MM F(x, vO), 29). 
S 
Since we have КАШ = vi, one can easily see the following relation holds: 


T u ^ 
v —E p eh ABS fi (Xu vO (x,), zO (x, )du 


[4 


zi (167) 


Similarly to the cases of the standard credit risk modeling (e.g. Bielecki-Rutkowski 
[6], it is the present value of default payment where the default intensity is А; 
with the default payoff at s(> t) as p" (Xs, v (x,), © (x,)). Thus, we obtain the 
following proposition. 


Proposition 3 The v(? in (165) can be equivalently expressed as 


V? 1 ы] Чылт (x 0X), 2X) [Fi]. ав) 


Here т is the interaction time where the interaction is drawn independently from the 
Poisson distribution with an arbitrary deterministic positive intensity process A;. f 
is defined as 
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^ 1 pg 
fis Gc, v (x), 20 (жуу = е v ро, 09 0а), 29 0). (69) 
S 


Now, let us consider the c-order's coefficient of Z(9, that is the component Z (D. ft 
can be expressed as 


zP = /  E[p.f (x. ое), 290) |а (170) 


Firstly, we observe that the SDE of the Malliavin derivative of V(U is given as 
follows: 


d(D,V(?) = —(D, Xi)vi(x, 00), 2) Fx, 00), 2) + (DZ) . d We; 


DV? = zi», (171) 

where 
Vi (x, v9, 2) = 8; + Gv (x)0, + 0,229 (х)д,а, (172) 
f(x, v, 2) = fx, v (x), «© (3), (173) 


Then, we define for (s > t), D, ү а$ 
D, VP = ef t(D, VM), (174) 
and its SDE can be written as 


d(D, VE?) = A (Di VE?) ds — ACD, XD) v; (x, 00, 209) fi. (x, 00, )ds 
+еЁ dup, 7) . qw, (175) 


Then, we again have 
DV? = 20. (176) 


Непсе, 


Т 
zi -z| / e TS, XLYVIOG, v9, 2) feu Xu, w, 2 du 


1 


F| . (177) 
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Thus, following the same argument as for the previous proposition, we have the next 
result: 


Proposition 4 240 in (170) is equivalently expressed as 
z/?* = Me aE [165 7) 05 GG Vi Xr, 0, 2) fir Xp, v, 2 | n], (178) 


where the definitions of random time т and the positive deterministic process А are 
the same as those in the previous proposition. 


Now, we are able to obtain a new Monte Carlo scheme. That is, we have a new 
particle interpretation of (УЧ), Z) as follows: 


ER 
Ш 


Ves E [ tn (00,90) |] (179) 


= Е [een svo»! Vi Xr, 00), 0) f, or, v9, 2] л], (180) 


N 
| 


which allows an efficient time integration with the following Monte Carlo scheme: 


e Run the diffusion processes of X and Y. 

e Carry out Poisson draw with probability As As at each time s and if “one” is drawn, 
set that time as т. 

e Then stores the relevant quantities at 7, or in the case of (т > Т) stores 0. 

e Repeat the above procedures and take their expectation. 


Finally, we remark that the higher order coefficients in the expansions are evalu- 
ated in the similar way. Please see [27] for the details. 


9 Conclusion 


The present note has reviewed an asymptotic expansion approach in finance, partic- 
ularly in terms of computational problems arising in practice of financial derivatives. 
in finance. However, due to the limitation ofthe space, we have not provided thorough 
explanations especially for recent progress such as improvement schemes in Sect. 5, 
expansion methods in jump and jump-diffusion models in Sect. 7 and perturbation 
schemes in forward backward stochastic differential equations (FBSDEs) in Sect. 8. 
Please see the cited papers for the details. 

Moreover, we have not introduced an application of the method to mean-variance 
hedging problems in partially observable markets, which is an interesting topic as an 
application of stochastic filtering problems in finance. Please see [29] for the detail. 
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some of the existing ones and also point out some possible connections to finance. 
We also hope, it will be useful for the reader to have, in one place, the most recent 
results concerning the small-time asymptotics questions related to rough differential 
equations driven by fractional Brownian motions. Our discussion will mainly be 
based on one hand on the papers [5—7] by the two present authors and on the other 
hand on the papers [27, 28] by Inahama. 

Random dynamical systems are a well established modeling tool for a variety of 
natural phenomena ranging from physics (fundamental and phenomenological) to 
chemistry and more recently to biology, economy, engineering sciences and mathe- 
matical finance. In many interesting models the lack of any regularity of the external 
inputs of the differential equation as functions of time is a technical difficulty that 
hampers their mathematical analysis. The theory of rough paths has been initially 
developed by T. Lyons [31] in the 1990s to provide a framework to analyze a large 
class of driven differential equations and the precise relations between the driving 
signal and the output (that is the state, as function of time, of the controlled system). 

Rough paths theory provides a perfect framework to study differential equations 
driven by Gaussian processes (see [19]). In particular, using rough paths theory, 
we may define solutions of stochastic differential equations driven by a fractional 
Brownian motion with a parameter H > 1/4 (see [15]). Let us then consider the 
equation 


t d t 
Xi =x «f Vo(X*)ds XM Vi (X3)d Bl, (1.1) 
0 i21 "9 
where x € R”, Vo, Vi, ..., Уд are bounded smooth vector fields and (В,); о is а 


d-dimensional fractional Brownian motion with Hurst parameter H € (1, 1). A 
first basic question is the existence of a smooth density with respect to the Lebesgue 
measure for the random variable X¥,t > 0. After multiple works, itis now understood 
that the answer to this question is essentially the same as the one for stochastic 
differential equations driven by Brownian motions: the random variable X? admits 
a smooth density with respect to the Lebesgue measure if Hórmander's condition is 
satisfied at x. More precisely, if J = (i1, ..., ix) € (0,..., а}, we denote by V; the 
Lie commutator defined by 


Vi = [Vi, [Viz з [Vras ү]... 


апа 
d(I) = К+ n(1), 


where n(J) is the number of Oin the word J. The basic and fundamental result 
concerning the existence of a density for stochastic differential equations driven by 
fractional Brownian motions is the following: 


Theorem 1.1 ([4, 10, 12, 24]) Assume Н > i and assume that, at some x € К", 
there exists N such that 
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span(Vr(x), d(I) x N} = R”. (1.2) 


Then, for any t > 0, the law of the random variable X* has a smooth density p, (x, y) 
with respect to the Lebesgue measure on R”. 


Once the existence and smoothness of the density is established, it is natural to 
study properties of this density. In particular, we are interested here in small-time 
asymptotics, that is the analysis of p;(x, y) when t — 0. Based on the results in 
the Brownian motion case [1, 2], and taking into account the scaling property of the 
fractional Brownian motion, the following expansion (in particular when n — d) is 
somehow expected when x, y are close enough to each other: 


_4(х,у) N А 
РУ) = qat "P (ао tryst, x pte), (1.3) 
1=0 


Our goal is to discuss here the various assumptions under which such expansion 
is known to be true and also discuss possible variations. The approach to study the 
problem is similar to the case of Brownian motion, the main difficulty to overcome 
is to study the Laplace method on the path space of the fractional Brownian motion 
(see [3] for the Brownian case). 

The paper is organized as follows. In Sect.2 we give some basic results of the 
theory ofrough paths and ofthe Malliavin calculus tools that will be needed. In Sect. 3, 
we prove a Varadhan's type small time asymptotics for In p; (x, y). The discussion 
is mainly based on [7]. In Sect. 4, we study sufficient conditions under which the 
above expansion (1.3) is valid. Our discussion is based on [5, 27, 28]. Finally, in 
Sect. 5, we discuss some models in mathematical finance where the asymptotics of 
the density for rough differential equations may play an important role. 


2 Preliminary Material 


For some fixed Н > І, we consider (©, F, Р) the canonical probability space 
associated with the fractional Brownian motion (in short fBm) with Hurst parameter 
H. That is, Q = Co([0, 1]) is the Banach space of continuous functions vanishing 
at zero equipped with the supremum norm, F is the Borel sigma-algebra and Р is 
the unique probability measure on © such that the canonical process В = (B, = 
(ВІ, MESS Be ), t € [0, 1]} is a fractional Brownian motion with Hurst parameter Н. 
In this context, let us recall that B is a d-dimensional centered Gaussian process, 
whose covariance structure is induced by 


som dj 
R(t,s):— В} Bi] => (29 e P8 - i 928), s,t € [0, Папа j =1,...,d. 
Q.1) 
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In particular it can be shown, by a standard application of Kolmogorov's criterion, 
that B admits a continuous version whose paths are y-Hélder continuous for any 
y < Н. 


2.1 Rough Paths Theory 


In this section, we recall some basic results in rough paths theory. More details can be 
found in the monographs [20] and [32]. For N є N, recall that the truncated algebra 
T (RÌ) is defined by 


N 
TYR) = ARA”, 


m=0 


with the convention (R“)®° = R. The set Т (IR^) is equipped with a straightforward 
vector space structure plus an multiplication &. Let mm be the projection on the 
mth tensor level. Then (T (R4), +, &) is an associative algebra with unit element 
1 є (RHS. 

Fors < t and т > 2, consider the simplex A7, = {(u1, ..., Um) Є [5, t]; u1 
< +++ < Um}, while the simplices over [0, 1] will be denoted by A". A continuous 
тарх: A? + T (IR?) is called a multiplicative functional if for s < и < t one 
has ху, = Xs,u хи. An important example arises from considering paths x with 
finite variation: for 0 < s < t we set 


xe >, (/ ах! de) ei DQ eip, (2.2) 
Аз 


1<i),...,im<d 
where {e1, ..., eg} denotes the canonical basis of 4, and then define the truncated 
signature of x as 
N 
SNE): A ә TYRD, — (5,0) SNE) = 1+ У x" 
m=1 


The function Sy (x) for a smooth function x will be our typical example of multi- 
plicative functional. Let us stress the fact that those elements take values in the strict 
subset G^ (R^) c T" (В), called free nilpotent group of step N, and is equipped 
with the classical Carnot-Caratheodory norm which we simply denote by |. |. For a 
path x є C([0, 1], GY (R), the p-variation norm of x is defined to be 


1/р 
Ixllp-var:t0, = sup {У іх 8 xul 
IIC[0.1] i 


where the supremum is taken over all subdivisions II of [0, 1]. 
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With these notions in hand, let us briefly define what we mean by geometric 
rough path (we refer to [20, 32] for a complete overview): for p > 1, an element 
x : [0, 1] > GP) (В) is said to be a geometric rough path if it is the p-var limit of 
a sequence 5| (x"). In particular, it is an element of the space 


CP va. (t0, 1], СЇРЇ (R4)) = (x e C(0, 1], GU? (RÀ) : IIx]l p-var:10,1] < оо}. 


Let x be a geometric p-rough path with its approximating sequence x", that 
is, x" is a sequence of smooth functions such that x" = 5, (x) converges to 
x in the p-var norm. Fix any 1 < q < p so that p^! + q^! > 1 and pick any 
h € С“ ([0, 1], R7). One can define the translation of x by h, denoted by 7, (х) 
by 

Ta (x) = lim Sip] (x" +h). 
n—oo 


It can be shown that 7), (x) is an element in СР“ ([0, 1], С.У (R7)). Moreover, one 
can show that Ту (x) uniformly continuous in h and x on bounded sets. 


Remark 2.1 A typical situation of the above translation of x by Л in the present paper 
is when x = B, the fractional Brownian motion lifted as a rough path, and h is a 
Cameron-Martin element of B. In this case, we simply denote 7; (B) = B +h. 


According to the considerations above, in order to prove that a lift of a d- 
dimensional fBm as a geometric rough path exists it is sufficient to build enough 
iterated integrals of B by a limiting procedure. Towards this aim, a lot of the infor- 
mation concerning B is encoded in the rectangular increments of the covariance 
function Ё (defined by (2.1)), which are given by 


Ru, = E |(B} — BD (B3 — B]. 
We then call 2-dimensional p-variation of R the quantity 


1/р 


; (si), G)ellp, 


ИР. 
VR) = sup | | 32 | 
hj 


where П stands again for the set of partitions of [0, 1]. Itis know that (see, for example 
[20]) if a process has a covariance function with finite p-variation for p € [1, 2), it 
admits a lift to a geometric p-rough path for all p > 2p. As a consequence, we have 
the following for fractional Brownian motions: 


Proposition 2.2 For a fractional Brownian motion with Hurst parameter Н, we 
have V,(R) < oo for all p = 1/(2H). Consequently, for H > 1/4 the process B 
admits a lift B as a geometric rough path of order p for any р > 1/H. 
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2.2 Malliavin Calculus 


We introduce the basic framework of Malliavin calculus in this subsection. The reader 
is invited to read the corresponding chapters in [33] for further details. Let € be the 
space of R¢-valued step functions on [0, 1], and H the closure of £ for the scalar 
product: 


d 
(Чо, 77 s Troa); toss] Шол) = У RG. si). 
ї=1 


We denote by К H the isometry between H and L?([0, 1. When H > 1 it can be 
shown that L!/P ([0, 1], R4) C H, and when 1 < Н < 1 one has 


C" CH c L^ ([0, 1) 


for all y > 5 H. 

We remark that H is the reproducing kernel Hilbert space for B. Let y be the 
Cameron-Martin space of B, one proves that the operator R :— Ry : H > Hy 
given by 


Rap i= ra Ky(-, [Kg v]G) ds (2.3) 


defines an isometry between H and #4. Let us now quote from [20, Chap. 15] a 
result relating the 2-d regularity of А and the regularity of £5. 


Proposition 2.3 Let B be a fBm with Hurst parameter 1 <H < 2. Then опе has 
Ha C CY for p > (Н + 1/2)-!. Furthermore, the following quantitative bound 
holds: 


I^ M o— var 
lla Z im 
TO SU 


Remark 2.4 The above proposition shows that for fBm we have Hy C С?“ for 
p > (H + 1/2)7!. Hence an integral of the form / ^ dB can be interpreted in the 
Young sense by means of p-variation techniques. 


Remark 2.5 Under the same conditions, the above embedding can be sharpened to 
Hy C С?“ for all p > (Н + 1/2)7!. We refer interested readers to [17] for more 
details. 


A F-measurable real valued random variable F is then said to be cylindrical if it 
can be written, for a given n > 1, as 


F = f (B@!),....B@") = iq па ав.) |, 
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where ¢' € Hand f : R” — R is a С° bounded function with bounded derivatives. 
The set of cylindrical random variables is denoted S. 

The Malliavin derivative is defined as follows: for F € S, the derivative of F is 
the IR valued stochastic process (D; F)o-;-, given by 


DF => 54 (Be, ..., con). 
i=l : 


More generally, we can introduce iterated derivatives. If F € S, we set 


For any р > 1, it can be checked that the operator Dé is closable from S into 
L? (Q; H®*), We denote by DE? the closure of the class of cylindrical random 
variables with respect to the norm 


k 
n] y j Р 
ІЕЕ р = | E (F?) + 25 5 (р^ |е) і 


апа 


Definition 2.6 Let F = (Fl,..., Е") be a random vector whose components аге 
in D”. Define the Malliavin matrix of F by 


qe = (DF! DF) yg isi jsn 
Then F is called non-degenerate if yp is invertible a.s. and 
(det yp) ! e ny iLP (Q). 


It is a classical result that the law of a non-degenerate random vector F — 
(F!,..., Е") admits a smooth density with respect to the Lebesgue measure on 
R”. Furthermore, the following integration by parts formula allows to get more 
quantitative estimates: 


Proposition 2.7 Let F = (F!,..., F") beanon-degenerate random vector whose 
components are in D™, and yr the Malliavin matrix of Е. Let С € D” and o bea 
function in the space Cp (R^). Then for any multi-index о € (1,2, ..., n, k > 1, 
there exists an element Ha = Н.(Е, С) € 0% depending оп F and С such that 


[Oa 9 (F)G] = Ely(F) Hal. 
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Moreover, the elements Ha are recursively given by 


Не) = Уч (coz!) 


j=l 
На = Hiap (H(ay,...,0%-1))> 


and for 1 < p < q < œ we have 


—1 [4 
н.а» < Clg DFI „Ск. 


here + = 14 1. 
where 7 ate 


Remark 2.8 By the estimates for Hy above, one can conclude that there exist con- 
stants 3, ^y > 1 and integers m, r such that 


=j А 
I Holz» < Cp,qil det yp WF sIIDF lle IG lle.g- 


2.3 Differential Equations Driven by Fractional Brownian 
Motions 


Let B be a d-dimensional fractional Brownian motion with Hurst parameter H > І. 
Fix a small parameter = є (0, 1], and consider the solution X7 to the stochastic 


differential equation 
а pt | t 
Xf =x ex V; (X)d Bi «f Vo(e, X5)ds, (2.4) 
i-i 70 0 


where the vector fields Vi, ..., Уд are C° -bounded vector fields on R” and Vo(e, -) 
is C??-bounded uniform in = є [0, 1]. 

Proposition 2.2 ensures the existence of a lift of B as a geometrical rough path. 
The general rough paths theory (see e.g. [20, 22]) together with some integrability 
results (see e.g. [12, 18]) allow us to state the following proposition: 


Proposition 2.9 Consider Eq. (2.4) driven by a d-dimensional fBm B with Hurst 
parameter Н > І, and assume that the vector fields Vis are C9?-bounded. Then 

(1) For each & € (0, 1], Eq. (2.4) admits a unique finite p-var continuous solution X* 
in the rough paths sense, for any р > qr 

(ii) There exists А > 0 such that 


z | exp à sup [XE PED. |) 2, (2.5) 
t€[0,1],e€(0,1] 
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Once Eq. (2.4) is solved, the vector X; is a typical example of random variable 
which can be differentiated in the Malliavin sense. We shall express this Malliavin 
derivative in terms of the Jacobian J^ of the equation, which is defined by the relation 


ELJ g,i 
y! 28, xt. 


Setting DV; for the Jacobian of V; seen as a function from R” to R”, let us recall 
that J^ is the unique solution to the linear equation 


i-o | 
Ж =1„+єЎ | руу(хеу Кав}, (2.6) 
А 0 
P 


and that the following results hold true (see [10, 11, 34] for further details): 


Proposition 2.10 Let X* be the solution to Eq. (2.4) and suppose the V;'s are С°- 
bounded. Then for every i = 1,...,n, t > 0, and x € №", we have X;" € 0° 
and | 

DIVA j=1,...,d, ОЕ 


where Di x?! is the jth component of D, X=", Ji = 0,X; and Ji, = LT 


Let us now quote the recent result [12], which gives a useful estimate for moments 
of the Jacobian of rough differential equations driven by Gaussian processes. 


Proposition 2.11 Consider a fractional Brownian motion B with Hurst parameter 
Н > 1 and р > E Then for any т = 1, there exists a finite constant ср such that 
the Jacobian J^ defined at Proposition 2.10 satisfies: 


| sup P ы = Gp. (2.7) 
e€[0, 1] 


Proof The integrability of J^ is only proved in [12] when є = 1. On the other hand, 
the estimates of J in [12] only depends on the supremum norm of the vector fields and 
their derivatives. In our case, the vector fields in Eq. (2.4) are eV! s whose derivatives 
together with themselves are bounded uniform in = є (0, 1). Hence the uniform 
integrability of J^ (in є) follows. 


Finally, we close the discussion of this section by the following large deviation 
principle that will be needed later. Let  : #4 — C([0, 1], IR") be given by solving 
the ordinary differential equation 


d t t 
тш) =х+ > | иван [wo odds. оз) 
i=l 
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Theorem 2.12 Let Ф be given in (2.8), which is a differentiable mapping from Ён 
to C([0, 1], R”). Introduce the following function on IR" 


1 
(у) = inf -lhg . 
(y) SUNT I^ M a, 
Recall that X} is the solution to Eq. (2.4). Then ХТ satisfies a large deviation principle 
with rate function I ( y). 


Proof Fixany p > i It is known (see [20]) that £B as a G!?! ((R4)-valued rough path 
satisfies a large deviation principle in p-variation topology with good rate function 


given by 


l 2 i 
dos z llall if h є IO 
+oo otherwise. 


It is clear ,(-) : СІРІ (В4) — R” is continuous. Now that X1 = (eB), the 
claimed result follows from the contraction principle. 


3 Varadhan Asymptotics 


In this section, we are interested in a family of stochastic differential equations driven 
by fractional Brownian motions В (with Hurst parameter Н > Т) of the following 
form 


а ы 

Xf = х ку | ү;(Х°)ав!. 
; 0 
i=] 


We define a map Ф : £j — С[0, 1] by solving the ordinary differential equation 


d ы 
Ф 2x - Y f V (b, (та. 
i=1 


Clearly, we have X? = Ф, (eB). Denote by уф л) the deterministic Malliavin matrix 
of (A), i.e., " | 
фу = (D$ 0), РФ! (А)) н. 


Introduce the following functions ор R", which depend on Ф 


1 


; 2 
inf [ДИ 
1 (h)=y,detyo,q)>0 2. # 


1 
2 : 2 2 
е Же, n6 dp) = 
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In the absence of the drift term (Vo — 0) in our setting in this section, one can show 
that the above two distances coincide. 


Lemma 3.1 For every y € К", we have d(y) = driv). 


Proof We follow an argument of Léandre (see [30]). By using Theorem I.2 in [30] and 
the isometry between the Cameron-Martin space of the fractional Brownian motion 
and the Cameron-Martin space of the Brownian motion, we see that for every є > 0, 
there exists Л є H such that ||h|| ;z < = and det y@,(n) > 0. Then arguing as in the 
Remark after Proposition II.1 in [30], we can for every 7 > 0 and y є R” construct 
h € H such that Pı (A4) = y, det ^o, л) > О and 


l uid 2 
zll S dO) Em. 


Throughout the section, we assume that the following assumption Hypothesis 3.2 
is satisfied. Let us first introduce some notations. Let A = {Ø} U (ЈР |01, 2, +, ny: 
and A; = A \ {Ø}. We say that J є A is a word of length k if = (i1, ---, ік) and 
we write |Z| = k. If Т = Ø, then we denote |/| = 0. For any integer / > 1, we denote 
by A(J the set {7 € A; |Z| < L} and by A, (J) the set (7 € Aj; |Z| < 1) . We also 
define an operation ж on A by J x J = (i1, ++ +, ik, Jr ss JD for = (iss ik) 
and J = (j1, +++, jj) in A. We define vector fields Viz] inductively by 


Мл = Уу, Vien = Min. Vil, /=1,.4 


Hypothesis 3.2 (Uniform hypoelliptic condition) The vector fields V;,---, Va are 
in Ср (IR") and they form a uniform hypoelliptic system in the sense that there exist 
an integer / and a constant А > 0 such that 


>, (Ил(х). и) > Ан]? (3.1) 
Te Aj (D 


holds for any x, u € IR" 


Under this assumption the main result proved in [7] is the following Varadhan's 
type estimate: 


Theorem 3.3 Let us denote by p-(y) the density of Хү. Then 


lim inf e log p-(y) = —d?(y). (3.2) 


The two key ingredients in proving Theorem 3.3 are an estimate for the Malliavin 
derivative DX] and an estimate of the Malliavin matrix ух; of X 1. Building on 
previous results from [8], the following estimates were obtained in [7]: 
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Lemma 3.4 Assume Hypothesis 3.2. For Н > 1, we have 


(1) Supze(o. у Xil, < 00 for each k > 1 andr > 1. 


(2) 1552 lle = cre! foranyr > 1. 


Proof of Theorem 3.3 We first show that 


lim inf c log p-(y) > —da?. (3.3) 
E 


Fix y € R”. We only need to show for d^ (y) < OO, since if d? (y) = oo 
the statement is trivial. Fix any 7 > 0 and let h є # be such that Ф (л) = 
у, det, (h) > 0, and IL < d2(y) + п. Let f € CS°(R"). By the Cameron- 
Martin theorem for fractional Brownian motions, we have 


ШЕ В(һ) 
Lf (Xj) =e 37 Ef(@i(eB+h)e=. 


Consider then a function x є C??(R), 0 < x x 1, such that y(t) = O ift £e 
[—2n, 21], and y(t) = 1 if є [—5, n]. Then, if f > 0, we have 


Will эе +40 


Sf (Xj) me 2 Exe B(h)) (ФЕВ + h)). 


Hence, we obtain 


1 
e? log p.) 2 — 7 + 2л) + 22106 Е(х(єВ(л))бу(Ф1(єВ + һ))). 
(3.4) 


On the other hand, we have 


:(x(EB(D)8, (bi (€B + һ))) = є-"Е (xem (Se E 2 к 2) | 


Note that лей жй 
Zi) = lim (eB +h) — Ф\(Л) 
€ 


Є 


is an-dimensional random vector in the first Wiener chaos with variance уф, (Л) > 0. 
Hence Z4 (A) is non-degenerate and we can then prove that we obtain 


$i(cB + А) — Ф\(Л 
lim (xeso (2:699 — 19) = gazio). 
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Therefore, 
lim e? log Е(х(є В(һ))бу(Ф1(єВ + h))) = 0. 


Letting = |, 0 in (3.4) we obtain 
ае 1 
lim inf є log pe(y) > — en i: 2n) > — (0) +3n). 


Since 7 > 0 is arbitrary, this completes the proof of (3.3). 
Next, we show that 


lim sup 22 log p-(y) < —d?(y). (3.5) 
Є}0 


Fix a point y є R” and consider a function x є Co^ (R^), 0 € x < 1 such that x 
is equal to one in a neighborhood of y. The density of ХТ at point y is given by 


pe) = EGCXD, (X1). 


By the integration by parts formula of Proposition 2.7, we can write 


2x9, 0X) = E (35i Haa... XF, XX) 
< E|Ha 2, (Xi, XXI 
= ?(IHa,2 un n)(Xq, ХОХ) 1Ц хе esuppy}) 


1 : 
< P(X] є supp)? |Н... (Xi X CC) IL p. 


where 5 + E = 1. By Remark 2.8 we know that 


335 


Finally by Theorem 2.12, a large deviation principle for Хт ensures that for small 
є we have 


Р(Хї є supp)? < 792 lint vesuppy POJ) 


This gives us (3.5). 
Combining Lemma 3.1, (3.3) and (3.5), the proof of Theorem 3.3 is thus com- 
pleted. 
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4 Small-Time Kernel Expansion 


4.1 Laplace Approximation 


Fix H > 1 and consider Eq. (2.4). For the convenience of our discussion, in what 
follows, we write the above equation in the following form 


t 


t 
Хё=х + Ji c(X;)d B, «f b(e, X;)ds, 
0 0 


where c is a smooth d x d matrix and b a smooth function from R* x В to R^. 
We also assume that c and b have bounded derivatives to any order. 

Fix p > i Let F and f be two bounded infinitely Fréchet differentiable func- 
tionals on C?—V"19. (t0. 1], R7) with bounded derivatives (as linear operators) to 
any order. We are interested in studying the asymptotic behavior of 


J(e) = Е f(X) exp{-F(X°)/e"}], — ase { 0. 


Recall for each k € #7, ®(k) is the deterministic Itó map defined in (2.8). Set 


A() = inf | |kl;z,, 9 = Pk), k € нү. 


| 
2 
Throughout our discussion we make the following assumptions: 


Assumption 4.1 e H 1: F + A attains its minimum at finite number of paths 


фы, фэ, ..., фа on P(R^). 
e Н 2: For each i € (1,2,...,n], we have ф; = (^j) and 7; is a non-degenerate 
minimum of the functional F o Ф + 1/2|| - [Ж le.: 


Vk € Жн\{0}, a? (F o o 1/2] lg) ik? > 0. 


The following theorem is the main result of this section. 


Theorem 4.2 Under the assumptions H 1 and Н 2 above, we have 


J(e) = С + ол +--+ aye" 3b oc»). 
Here 
a = inflF + A(9),ó € P(RÀ)] = inf(F o () + 1/2147, k E€ Hn} 


and 
c= inf {dF (oj) Yi, i € (1,2, әп}, 
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where Y; is the solution of 

dY;(s) = dxa (Qi (s))Y;(s)d^i (s) + O-bO, Фф; (s))ds + O,b(O, o; (s))Y;(s)ds 
with Y; (0) = 0. 


In what follows, we sketch the proof of the above Laplace approximation in the 


case Н > 1. Remarks on the rough case 1 <H< 1 will be provided afterwards. 


Without loss of generality, we may assume that F + A attains its minimum at a 
unique path ф. There exists a y € # such that 


1 
ф=Ф(), and AS) = 5171 
and 


ег. 1 
a Č inf{F + A(d), 6 € PRD) = inf Fo Bb) + lieu, kE ori. 


Moreover by assumption Н 2, for all non zero k € Ay: 


Ф(ЕоФ+ Ц IA Y? > 0 
2 Hy ^Y > Ч. 


Consider the following stochastic differential equation 


t 


t 
27 =x «f c (Zi)(edBs + dys) «f b(e, Z;)ds. 
0 0 


It is clear that Z? = ф. Denote 27° = 0" ZF and consider the Taylor expansion 
with respect to = near = = 0, we obtain 


N а 
- ied - 
Zz = o+ > a EFT RS a, 
j=0 ^ 


where gj = 779. Explicitly, we have 
dgi(s) = o(¢s)dBs + xa (ós)gi(s)d^ys + Ocb(O, os) g1(s)ds + Ocb(O, os )ds. 


Now the proof is divided into the following steps. 
Step 1: By the large deviation principle, the sample paths that contribute to the 
asymptotics of J(e) lie in the neighborhoods of the minimizers of F + A. More 
precisely, for р > 0, denote by B(¢, p) the open ball (under A-Hólder topology for 
a fixed А < Н) centered at ф with radius p. There exist d > a and єр > 0 such that 
for all = < єр 
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[JO - E [race POP, x* e Beo | s e". 


Hence, letting 


4/6) = E| roce Р x* e BE, р), 


to study the asymptotic behavior of J (=) as = | 0, it suffices to study that of J(e). 
Step 2: Let 0(=) = F(Z^) and write 


1 
(=) = 000) + є®Ө'(0) + 56 9 (0) + e (Б). 
By the Cameron-Martin theorem for fractional Brownian motions, we have 
Jp(&) (4.1) 


ki 
we F(Z*) її КОЕ 1512, | 
=E ls )exp (- E ES (44 (Ki) LK 35), Bs = a ; 7° є Bid, p) 


І І 6() + ff ((3)71 (55 9),4Bs 
e| exp 5 H 9] vo Jo ( H H V) d 


Е 


ехр [-30"0| [AZO] ; 7° є BO, pj. 


Step 3: It is clear that to prove Theorem 4.2, it suffices to analyze the four terms in 
the expectation above. First of all, it is apparent that the first term (of order-2) is 


1 1 -a 
exp E: (Fw + бл) =, (42) 


which gives the leading term the Varadhan asymptotics. 
The second term (of order-1) is deterministic. Indeed, since y is a critical point 
of Fo ^ + 1/2|| - бд, and note |||. ж, = ПК, ме ћауе 


T š 
dF (фу(4Ф(С ОЕ) = — n (KEKR) ds. 


By the continuity of Young's integral with respect to the driving path, the above 
extends to 


T: Я 
ағас) =- | (к c), ав. 


On the other hand, note 
0'(0) = 4Е(Ф)д\, 
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and 


gı = dó(y) B + Y. 


Here Y is the solution of 


dY; — oxo (ds )Ysdys F 0-b(0, ds )ds + Ox b(0, Ф») Уа, Ү(0) = 0. 


We obtain 


0OY + у (Kid Ka D), dB | _ т [- irr] 
Е 


(4.3) 
€ 


For the third term (of order 0), one can show that there exists а 3 > 0 such that 


1 
V exp [- + 8) Ex « oo. (4.4) 


Let us emphasize that in order to show the above integrability of 0" (0), one needs 
to use assumption H2 and prove that d?F o o ()(k!, k?) is Hilbert-Schmidt. For 
more details, we refer the reader to [5] for the case when Н > 1, and to [27] when 
1 <Н< І. Moreover, опе can prove the following integrability of А (=). 


Lemma 4.3 There exist а > 0 and єр > 0 such that 


sup E a relents). ZE € Bid, p) « oo. 


0<=<=0 
Lemma 4.3 and (4.4) allows us to analyze ће third and forth terms and show 


N 
[F (Z)? ORO: ZE є BY, ру] = У ame” + ONT). (4.5) 


m=0 


Finally, combining (4.1)-(4.3), and (4.5), the proof of Theorem 4.2 is complete. 


Remark 4.4 In application (see the next section), one may also be interested in an 
SDE which involves a fractional order term of e, 


t t 
X =х+ ef o(X®)dB, +e f b(e, X5)ds. (4.6) 
0 0 
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For this purpose, let us first introduce 


A= [m 2 т, = 0,1,2,...}, (4.7) 
Н 
the set of fractional orders. Let 0 = ко < ку < ко < --- be all elements of A, in 
increasing order. When Н > І, we have 
( ) 0,1 : 2,1 + : (4.8) 
ко, K1, K2, K3, K4, ...) = (0, 1, —, 2, — Wis ; 
0, K1, K2, K3, K4 H H 
Set 
Аз = {k — 2|& € A10}, 
and define 
Аз = {a a2 +- +am|m € N and a1, ..., am € Aq} 
and 
A = {а1 +а +---+am|m € № and aj,..., am € Ad}. 
Finally let 


Ag = {а + bla € As, b € АЗ} 


and denote by (0 = Ao < А < А < ...} all the elements of A4 in increasing 
order. Let us note that the set Лз characterizes the powers of = coming from the term 
f (Z*) in (4.1) and л; characterizes that of е5). 

Similar as before, we consider 


_ t , i t _ 
р =х+/ eZ) ed, +dy) е | b(e, 2°)аз. (4.9) 
0 0 


It can be proved that Z* has the following expansion in =, 


N 
7° = Ф+ У gue" ЕЧ We 
j=0 


Note that in (4.8), indices up to degree two are (0, 1, 1/ H, 2). There is an extra term 
1/H compared to the case without fractional order. Hence when plugging (4.9) into 
Step 2 of the proof of Theorem 4.2, there is an extra (but deterministic) term 


аЕ(Ф)дкә 
Ру Ул}, 
E" Н 
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where дк satisfies 


dg; (s)= Orbs) gi; (s)dys + b(0, ds)ds, Ino (0) = 0. 


It is not hard to see that the other terms up to degree two remain the same, and that 
although higher order terms are different they could be handled similarly as before. 
Hence we obtain 


Theorem 4.5 Let X* satisfy (4.6). We have 


AN 


Ex E) Jg? -4 с d 
Г /(Х®)уе FONE) е Ze Few |-— | (о» Faye’ ense 
E^ H 


+ O (gin ). 
Неге 


a = inf(F o Ф(Е) + 1/2144. k € Ha}, 
c=dF(@)Y, and d=dF(4)Gx, 


where Y and дк, satisfy 
dY (s) = xo (oi (s))Y (s)dy(s)+0:b(0, ó(s))ds--O;b(0, ó(s))Y(s)ds, Y(0) = 0, 


and 


Any (S) = 0xo(05)gi, s)d*s + DO, ó,)ds, дк, (0) = 0. 


Remark 4.6 "Theorem 4.2 for the rough case 1 < Н < 1 was proved by Inahama 
[27]. In this case, equation is understood in the rough path sense. Thanks to Propo- 
sition 2.3, equations for g; and R; are understood as Young's paring. 

In [27] the author also discussed RDEs with fractional orders of =, in which the 
index set A, was introduced. The main idea of the proof for the rough case is the same 
as that outlined above. But the major difficulty is to show that d 2Fo® (9) (kl, k?) is 
Hilbert-Schmidt. This is easier when Н > І, since in this case д; К (f, s) is integrable, 


and one can easily obtain a nice representation for dF o (y) (kl, k). 


4.2 Expansion of the Density Function 


Consider 


d t t 
x-sM[ vasi + | Vo(Xs)ds. (4.10) 
i=] 0 0 
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We are interested in studying the small-time asymptotic behavior of X;. It is clear 
that by the self-similarity of B, this is equivalent to studying the asymptotic behavior 
of ХТ (for small =) which satisfies 


d t | 1 t 
х=] усхоаві+ е? | Vo(Xs)ds. 
i=1 79 ” 


In what follows, we use the Laplace approximation to obtain a short time asymp- 
totic expansion for the density of ХТ in the case when Н > i. For this purpose, we 
need the following assumption. 


Assumption 4.7 e A 1: For every x € Rf, the vectors Vi (x), --- , Va(x) form a 
basis of R4. 
e A 2: There exist smooth and bounded functions и! j such that: 


d 
[Vi Vj] = Š oi; Vi 
l=1 


and 
А j 
ips =w]. 

Assumption A1 is the standard ellipticity condition. Due to the second assumption 
A2, the geodesics are easily described. If k : Ко — R is a o-Hólder path with 
a > 1/2 such that k(0) = 0, we denote by ®(x, k) the solution of the ordinary 
differential equation: 


d t 
xyoxt PM Vi(xs)dki. 
i=l 0 


Whenever there is no confusion, we always suppress the starting point x and denote 
it simply by ®(k) as before. Then we have (see Lemma 4.2 in [5]) 


Lemma 4.8 ®(x, k) is a geodesic if and only if k(t) = tu for some и € R¢. 


As a consequence of the previous lemma, we then have the following key result 
(Proposition 4.3 in [5]): 


Proposition 4.9 Let T > 0. For x, y € В“, 


Жн T Т?Н 


inf 
кє.#н,Фт(х,К)=у 
Lemma 4.10 For any x € Rf, there exists a neighborhood V of x and a bounded 
smooth function F(x, у, х) on V x V x R? such that: 
(1) For any (x, y) € V x V the infimum 
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d(x, z)? 
inf | F(x, y,z) + ee 


zem} =o 


is attained at the unique point y. Moreover, it is a non-degenerate minimum. 
Hence there exists a unique k? є Hy such that (a): Фу(хо, КО) = уо; (b): 
d(xo, yo) — LAPA ; and (c): k? is a non-degenerate minimum of the functional: 
k > F(®1(xo, k)) + 1/2112 „ оп Жн. 

(2) For each (x, y) € V x V, there exists a ball centered at y with radius ғ 
independent of x, y such that F(x, y, -) is a constant outside of the ball. 


Let F be in the above lemma and p-(x, y) the density function of X1. By the 
inversion of Fourier transformation we have 


_ Е(х,у,у) 1 has: ic PES Е(х,у,2) 
pex, yje 2 = fe ас fe е 2 pex, z)dz 
T 


(27)! 
1 MEL Te: _ FQ,y,z) 
= One е4 [ее è p,(x,z)dz 
1 | ic -» _ К@,у,Х\) 
= Qne)d dG ux Na £ e g . (4.11) 


Itis clear that by applying Laplace approximation to the expectation in the last equa- 
tion above and switching the order of integration (with respect to ¢) and summation, 
we obtain an asymptotic expansion for the density function pe(x, y). 


Remark 4.11 One might wonder why not constructing, for each fixed x, y, a function 
F which minimizes (at z — y) 


D(x, zy? 


F(x, y, 
(x, у, z) + 2 


in Lemma 4.10, where 


D^(x,y)— inf к. 
G5 y) key, (x, k)—y | EZ 


After all D(x, y) seems the natural “distance” for the system (4.10), instead of 
the Riemannian distance d (x, y). The problem with D(x, y) is that it is not clear 
weather it is differentiable, while the construction of F in Lemma 4.10 needs some 
differentiability of D(x, y). This is indeed one of the reasons why we impose the 
structure assumption A2 so that D(x, y) — d(x, y) (content of Proposition 4.9). 
With this identification, we know D(x, у) is smooth for all x Æ y. 


Remark 4.12 In order to show Proposition 4.9, we used the fact that OK (t, s)/Ot is 
integrable, which is only true for the smooth case Н > 1. Hence although Inahama 
proved the Laplace approximation for 1 <H< 5 in [27], we can not repeat the proof 
in this section to produce an expansion of the density function for the rough case. 
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Recall the definition of A; in Remark 4.4 and similarly set 


Ag = {к — 1|к € Ay\{O}} 


and 
А» = {к — 2|& € A1M0]J. 
Next define 
Аз = {ay +a +---+ay,|m € N4 and aj, ...,аһ € Az}. 
and 
A5 = {ay a2 +--+ +ay|m € Ny and aj, ..., am € A5]. 


Finally, set 
Ag = {а + bla € Аз, Бє A3] 


and denote by {0 = Ao < Ay < А№ < ...} all the elements of A4 in increas- 

ing order. Similar as before, powers of e in the index set A3 comes from the term 

exp [ic (XT - y)/e} in (4.11) and powers in A5 comes from exp{—F (x, y, X§)/e7}. 
Our main result of this section is the following (by letting = = t”). 

Theorem 4.13 Fix x € R^. Suppose the Assumption 4.7 is satisfied, then in a 


neighborhood V of x, the density function p(t; x, y) of X, in (4.10) has the following 
asymptotic expansion near t = 0 


N 
Sy). B 
a Qc (Sac. SP ruat, x, yan). y € V. 
ї=0 


р@; x, y= GHA 


Here В is some constant, d(x, y) is the Riemannian distance between x and y 
determined by V,,..., Va. Moreover, we can chose V such that cj(x, y) are C9? 
inVxVCR¢x В, and for all multi-indices о and В 


sup вир l0? OP rw (t, x, y)| « oo 
1<10 (x,y)eV xV 


for some to > 0. 


Remark 4.14 Differentiability of c; (x, y), ry+1 in the above theorem and legitimacy 
of Fourier inversion in (4.11) is obtained by Malliavin calculus and some uniform 
estimates of the coefficients in the Laplace approximation. We refer the reader to [5] 
for details. 


Remark 4.15 Our result assumes the ellipticity condition and a strong structure con- 
dition (Assumption 4.7). Later Inahama [28, 29] proved the kernel expansion (for 
Н > i ) under some mild conditions on the vector fields. He takes a different approach 
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and uses Watanabe distribution theory. Hence he is able to work with D(x, y) intro- 
duced in Remark 4.1 1 directly and avoids the technical assumption A2 of Assumption 
4.7. On the other hand, the smoothness of coefficient and the uniform estimate for 
the remainder terms in the expansion are not provided in [28, 29]. 


5 Application to Mathematical Finance 


Fractional Brownian motions have been used in financial models to introduce mem- 
ory. In this section, we give two examples of such models and remark on how the 
methods and results in the previous sections could be applied to the study of such 
models. 


5.1 One Dimensional Models 


Memories can be introduced to stock price process directly. In particular, the so-called 
fractional Black and Scholes model is given by 


2 
S, = So exp (v t oBE — z^ (5.1) 


where B is a fractional Brownian motion with Hurst parameter Н, и the mean rate 
of return and c > 0 the volatility. Let Бе the interest rate. The price for the risk-free 
bond is given by e''. 

More generally, one can also consider a fractional local volatility model 


d$; = S (udt + o(S;)d BE). 


Here the stochastic integration with respect to B could be understood in the sense 
of rough path theory. After a simple change of variable X, = log S;, one obtains 


dX, = pdt + c(e*dBP. 


There has been an intensive study recently of option prices and implied volatilities for 
options with short maturity (e.g. [9, 16, 21]). Since the above equation is a special 
case of (4.10), we can use the results obtained in the previous sections to obtain 
short-time asymptotic behavior of such models. 

A drawback of the finance models discussed above is that they lead to the existence 
of arbitrage opportunities. For example, let the couple (a;, 9;),t € [0,7] be a 
portfolio with о; the amount of bonds and /3; the amount of stocks at time г. One can 
construct an arbitrage in the fractional Black and Scholes model by (for simplicity, 
we assume и — r = 0) 
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t 
bi = S; — So, and o; =f Bid S; — 3 St. 
0 


Let V; be the value of the portfolio at time f. It is not hard to see that this is a self- 
financing portfolio that satisfies Vy = 0 and V; = (5; — So)? for all > 0, and hence 
it is an arbitrage. For more discussion on arbitrage in models given by fractional 
Brownian motions, we refer the reader to [35]. 


5.2 Stochastic Volatility Models 


Stochastic volatility models were introduced to capture both the volatility smile and 
the correct dynamics of the volatility smile (see [23] for instance). For these models, 
modeling the volatility process is one of the key factors. In [14], the authors proposed 
along memory specification of the volatility process in order to capture the steepness 
of long term volatility smiles without over increasing the short run persistence. 

The following stochastic volatility model based on the fractional Ornstein- 
Uhlenbeck process provides another way introducing long memory to the volatility 
process: 

dS, = uSi;dt + o: S;dW;, 


where c; = f (Y;) and Y; is a fractional Ornstein-Uhlenbeck process: 
dY, = o(m — Y,)dt + f d BP. 


In the above W; is a standard Brownian motion and в an independent (of W;) 


fractional Brownian motion with Hurst parameter Н > 1. Examples of functions f 
are f (x) = e* and f(x) = |х|. 

Comte and Renault [13] studied this type of stochastic volatility models which 
introduces long memory and mean reversion in the Hull and White setting [26]. The 
long memory property allows this model to capture the well-documented evidence of 
persistence of the stochastic feature of Black and Scholes implied volatilities when 
time to maturity increases. 

Unlike one dimensional models mentioned above, the fractional Ornstein- 
Uhlenbeck model is arbitrage free since the stock price process is driven by a standard 
Brownian motion. In [25], Hu has proved that for this model, market is incomplete 
and the martingale measures are not unique. If we set yy = (r — ш) /о; and 


dQ T Г 
"LES ( | аў - 5 | тш). 


Then Q is the minimal martingale measure associated with P. Moreover, the risk 
minimizing-hedging price at t = 0 of an European call option with payoff (Sr — K)* 
is given by 


On Small Time Asymptotics for Rough Differential Equations ... 437 


Co = e "TEQ(Sr — K)*. 


The fractional Ornstein-Uhlenbeck model takes a generalized form of Eq. (4.10) 


that is studied in the previous sections. It is a system of SDEs driven by fractional 
Brownian motions, but with varying Hurst parameter H . We believe that the methods 
discussed above can be extended to study small-time asymptotics of these models. 
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On Singularities in the Heston Model 


Vladimir Lucic 


Abstract In this note we provide characterization of the singularities of the 
Heston characteristic function. In particular, we show that all the singularities are 
pure imaginary. 


Keywords Heston : Complex singularities 


1 Problem Formulation 


Consider the Heston stochastic volatility model, which under risk-neutral measure 
and with zero drift has the following dynamics 


dS; = Si v awh”, 
dv, = А — vi) dt + n/v (аи) + V/1— р? dWy”), 
where the parameters А, 77, and v are nonnegative, p € [—1, 1], and the initial values 


So and vo are positive. 
The Heston characteristic function is defined as 


óu(u,T) = L [et 0868/50], а < S(u) < B. 


Results of Heston [5] and Lewis [7] show that on the strip of convergence œ < 
З(и) < 8 the Heston characteristic function coincides with 


ф(и, т) = eC Du T)vo. uc Z, 
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where 
Lae" 2 jg" 
DUT) -T-y ge Cu n Arr ma log ( pem |. 
+d 5 5 г 
re = р , d= 82 + 2am*, g= —, 
7] r4 
u? iu 
aem B — А+ piu. 


With the customary abuse of terminology, we'll refer to ф(и, т), и є Z as the Heston 
characteristic function. 

Using a result! of Lukacs [8], Lewis [7] points out that ф(и, т) has singularities 
on the imaginary axis at the boundaries of the strip of convergence. Whether there 
are any other singularities (necessarily complex-conjugate) on that boundary could 
not be readily established. Furthermore, no conclusions can be made about singular- 
ities outside of the strip of convergence. The purpose of this note is to provide full 
characterization of the singularities of ф(и, т). 


2 Main Result 


The following theorem, although presented as an existence result, allows for con- 
struction of the singularities of Ф(и, т) via standard numerical methods. 


Theorem 2.1 All singularities of ó(u, т) are pure imaginary. 
Proof Assume 7 > 0, as for 7 = 0 we have the Black-Scholes model whose char- 
acteristic function is free of singularities (see, e.g., Lewis [7]). 


To simplify notation we put is — u and show that the (essential) singularities of 
Ф(15, т) are real. To this end, we show that the transcendental equation 


SR Q.1) 
r= 
where 
б = А— pns (2.2a) 
d = f —7s(s — 1) (2.2Ь) 
г+ = В 7 2 (2.2с) 


has only real roots. 


lAs noted in Lukacs [8], this is a corollary of a more general result on Laplace transforms, e.g. 
Theorem II.5b of Widder [9]. 
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We consider (2.1) and (2.2) as a system in d and s. Equation (2.1) can be written as 
d = (—A + pns) tanh(rd/2). (2.3) 

From (2.2a) and (2.2b) we get 
=(1—= ГУ + 5002 — 2pnd) + X — d? = 0, (2.4) 


so, with q := y 1 — p*, we can express s in terms of d: for q #0 


п — 20A Æ y (n — 2p)? + Aq? (X2 — d?) 


$1/2 = 2421 (2.5) 
and for q = 0 and 2рл — 7 40 
d? = ^2 
Same (2.6) 
n? — 2Apn 


If q = 0 and 2pA — 7 = 0 from from (2.1), (2.2), and (2.4) we obtain d = A, p = 1, 
n = 2A, which implies that the only singularity is 


1 
БТ ША 


If d = 0 we have equality in (2.3), while from (2.5) and (2.6) it follows that the roots 
in s are real. 
For d Æ 0 substituting (2.5) in (2.3) yields 


— 2рА + y) — 2рА)2 + 442022 — a? 
i ма 120+ V0 2 +442( еа 
q 


while substituting (2.6) in (2.3) gives 


2 2 


d^ — А 
d= (^ + х) tanh(rd/2). 
п = 2pr 
which imply, respectively, 
2 
(2dq? coth(rd/2) + 2\ — pn) = p(n 2рА)2 + 44202 — a2), (277) 


and 
p 


no OF — 47). (2.8) 


d coth(rd/2) + A = 
п 
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With 
Td, т( — 2Ар) TÀ 
— —iz a= ————sgn(p), b= —, c= 
2 4q 2 q 
Lemma 2.2 implies that the roots of (2.7) are either real or pure imaginary. For the 
special case (2.8), Lemma 2.3 with 


Td, TX 2p 

— =. b= —, c= ——— 

2 2 T(N — 2pA) 

implies that the corresponding roots are also either real or pure imaginary. 
Therefore, it follows that for d # 0 the expression in the brackets in (2.3) is 

real (being ratio of either real or imaginary numbers), which in turn implies that the 

solutions of the transcendental equation (2.1) are real in s. 


Lemma 2.2 For real a and real nonnegative b, c the roots of the equation 
(zcot(z) +b — ac = (a? +b? + 12), 167 (2.9) 


are real or pure imaginary. 


Proof For c = 0 the result follows from Lemma А.б. If c > 0 from Lemma 2.4 we 
have that for sufficiently large N equation (2.9) has 4N + 2 roots inside the square 
with vertices (№ + 1/2) (т, ci). On the other hand, from Lemmas A.1 and A.3 
it follows that there are 4N + 2 real or pure imaginary roots inside the same square, 
so the result follows. 


Lemma 2.3 For real nonnegative b and real c the roots of the equation 
zcot(z) +b —c( +27), zeZ (2.10) 


are real or pure imaginary. 


Proof For c — 0 the result follows from Lemma A.6. Putting a — 0 in Lemma 2.4 
we conclude that for every c Æ 0 and sufficiently large № equation 


(ссох) -b 2? (0 +27), сє? 


has 4N + 4 roots inside the square with vertices (№ + 1/2) (2-7, ci). On the other 
hand, from Lemmas A.2 and A.4 it follows that both equations 


zcot(z) +b = +c(b? + 22) 


have 2N + 2 real or pure imaginary roots inside the same square, whence the result 
follows. 
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In the next lemma? 


[Theorem 9.2.3]). 


we make repeated use of the Rouché's theorem (e.g., Hille [6] 


Lemma 2.4 Let Cy, N € N denote the square in complex plane with vertices at 
(N + iX т, xim). Then for real a, nonnegative b, c, and d = 1,2 there exists 
No € N such that for every integer N > No the equation 


(zcot(z) +b — ас)? = ca? +b? + 2y!, тє? (2.11) 


has 4N + 2d roots inside Cy. 


Proof Consider the case d = 1, с > 1 and the case d = 2, c > 0 together. On the 
right vertical side of Cy we have 


У ету 


|cot(z)| = eo (Z + Nx iy) = |tan(iy)| = <1, (2.12) 


ey Fe 
while on the upper horizontal side we have 


1 + e N+) g2ix 14 e- QN- Dr 


< 
= 1 e- CNH)" 


212 
ет +1 
| cot(z)| = | = 


e2iz = 1 


T= е-@М+1)т2їх 


Together with (2.12) and the fact that | cot(z)| = | cot(—z)| this implies 


4 e- QN-Dv 


| cot(z)| < 


SiL aN kw, 2 € Cw, 


For z € Cy have 


I cot(z) +b – ас)?| _ (|zcot(z)| + |b — acl" 
[с2(а2 + D? + z2)|4 — Ic? (a? + b? + z2)|d 


2 
P (= А m) 
с [ez] 
Since т,» oo kn = 1, the last expression tends to (2 — d)/ с2 <1 uniformly in z as 
N — оо, so for sufficiently large № we have 


г? 


(a? + b2 + 22)4 


\(zcot(z) +b — ac)?| < |? (a? +b Ez), z e Cy. 


Therefore, by Rouché's theorem the number of roots of (2.11) inside Cy is equal 
to the number of poles of z к> (zcot(z) +b — ас)? — c?(a? + D? + z2y! inside 
Cy plus the number of zeros of z — ca +b + zd inside Cy (considering 


2А weaker version of this result (dealing with the case of real roots only) appears as Problem E1295 
in American Mathematical Monthly, Vol. 65., No. 6, p. 450. 
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their multiplicities). For sufficiently large N those two numbers are 4N and 2d 
respectively, whence the equation (2.11) has 4N + 2d roots inside Cy. 

Consider now d = 1,0 < c < 1. Let Dy be the square vertices at (CEN 7, 3 Ni), 
and let Di, denote Dy extended with semicircles of radius є so that the poles of 
cot(z) at +N7 are inside D‘,, but the real zeros of (2.11) in (Nz, (N + 1/2)7) 
and (—(N + 1/2)т, —N) described in Lemma А.І remain outside. For ease of 
exposition in what follows we make є smaller if necessary, which can be done without 
invalidating previously established statements. 

Similarly as before, on the right vertical side of Dy we have 


У ету 
| tan(z)| = |tan (Мт + iy)| = |tanGy)| = [2—1 < 1, (2.13) 
e re 
while on the upper horizontal side we have 
2iz _ 1 1— e ?Nn g2ix 1+ e 2Nv 
| tan(2)| = |257 +1 | = | 1+ e-2NTghix| ^ р е 2м (2.14 
Together with (2.13) and the fact that | cot(z)| = |cot(—z)| this implies that for 
sufficiently small e > 0 
_ e 2Nv 
| cot(z)| > I4 e 2їт Skp ZE Dy. (2.15) 
On Dj, we have 
|c? (a? + b? + 22)| К 2 " Ic? (a? + Ь2)| 
|(z cot(z) + b — ac)?| ^ (сока |2 ) (izil сос) — |b — acl) 
so for N large enough 
Ic (a? + Б? + у] с? Ic? (a? + Ь?)| 


|(z cot(z) + b — ас)2| 7 (kw _ == 


ү (12у = |b — ас)" 


Since іт,» оо kn = 1, the last expression tends to с2 <1 uniformly in z as N — oo, 
so for sufficiently large N we have 


le^ (a? +b? + 22)| < Kzcot(z) +b —acY|, ze D$. 


Therefore, by Rouché's theorem the number of roots of (2.11) inside D$, is equal 
to the number of poles of z — (z cot(z) + b — ас)? — c?(a? + D? + 22) inside Dy 
plus the number of zeros minus the number of poles of z œ> (z cot(z) + b — ас)? 
inside D4, (considering their multiplicities). The two mappings have common poles, 
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so we are left with number of zeros of the second mapping, which for sufficiently 
small e, according to Lemma A.6, is 4N. Therefore, from Lemma A.5, and taking 
into account two real zeros in (C(N + 1/2)z, Мт) О (Мт, (N + 1/2)т) whose 
existence is established in Lemma A.1, we conclude that for d = 1, 0 < c < 1 and 
sufficiently large N there are 4N + 2 zeros inside Cy. 
Finally, consider the case c = 1, d = 1. Puta = b — ac, 8? = a? + Ь2, so that 
we get 
cos(2z) > sin(2z) 2 
2 az — 
sin?(z) sin?(z) 


— a2) = 0, (2.16) 


or, equivalently, 
2 cot(2z) cot(z)z? + 2az cot(z) — (3° — а?) = 0. 


On Оу we have 


= Be 18202] 
|202 cot(z) — (82 — о2)| E 2а 20010) | | 22| + ЕТЕУ 
|2 cot(2z) cot(z)z2) ^^ 2l|z||cot(22)) ^  2lz|kow 


Since іт, оо kn = 1, the last expression tends to zero uniformly in z as № — oo, 
so for sufficiently large N we obtain 


|202 cot(z) — (8? — a*)| < |2cot(2z) cot(z)z*|, z € D$, 


that is, 
cos(2z) > 


2... „2 € 
um (8 a^), «єр. 


| sin(2z) 
> ac Se, T 
sin (2) 


Thus, by Rouché's theorem this implies that the number of roots of (2.16) inside Dj, 
equals the number of zeros of z > 581012 inside Ру, which is 4N. Therefore, 


reasoning as in the previous part of the proof we conclude that for c — 1 we have 
4N -F 2 zeros of (2.11) inside Cy for N large enough. 


Acknowledgments I wish to thank Tomislav Sekara of University of Belgrade and the anonymous 
referee for their comments and suggestions. 


Addendum 


The first version of this paper appeared on SSRN in 2007 (following the author's 
investigation into applicability of the Talbot's numerical inversion method in trans- 
form analysis of option prices). Since then several publications have appeared using 
the main result of the present work, which we list below for completeness. 
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Based on Theorem 2.1, in Ferreiro-Castilla [2] and del Baño Rollin et al. [1] a 
smoothness result for the density of the log-spot in the Heston model is presented, 
together with an alternative proof of our main result. Theorem 2.1 was also used in 
Friz et al. [3] in the study of the asymptotic behaviour of the stock price density in 
the negatively correlated Heston model. Finally, Lemma 6.1 from Gulisashvili et al. 
[4], used in the study of the asymptotic behaviour of the mixing distribution density 
in the uncorrelated Heston model, is quite close in spirit to the results presented here. 


Appendix 
Lemma A.1 For № sufficiently large, equation (2.9) has 4N —2 real roots in (—(N + 
1/2)т, =r) О (т, CN + 1/2)2). 


Proof By Lemma A.5 for every N > 1 equation (2.9) as two real roots in each of 
the intervals (—(k + 1)z, —kr) and (Кт, (k + 1) п), к= 1,2,...N — 1. 
Rewrite (2.9) as 


zcot(z) = —(b — ac) + cv a2? + P? + 22. (A.1) 


Fo N > 0 
lim zcot(N7)=+00, zcot(Nm + 7/2) = 0, (A.2) 
z> Nat 


so we conclude that for sufficiently large N the equation with plus sign has one real 
root in (Nz, (№ + 1/2)7), hence by symmetry in (—(N + 1/2)т, —N). 


Lemma A.2 For N sufficiently large equation (2.10) has 2N real roots in (—(N + 
1/2)т,—т) О (п, (N + 1/2)2) if c > 0, and 2N — 2 real roots if c < 0. 


Proof By Lemma A.5 for every N > 1 equation (2.10) has one real root in each of 
the intervals (—(k + 1)z, —kr) and (Кт, (k + Dy), к= 1,2,...N — 1. 
Rewrite (2.10) as 
zcot(z) = —b + c(D? + 22). (A.3) 


From (A.2) and (A.3) we conclude that if c > 0 for sufficiently large N equa- 
tion (2.10) has one real root in (Мт, (N + 1/2)7), hence by symmetry in (—(N + 
1/2), — NT). 


Lemma A.3 For real a, nonnegative b, and c > 0 equation (2.9) has either four 
real roots in (—т, т), or two real roots in (—7, т) and two imaginary roots. 


Proof The proof follows by simple geometrical considerations. For z = 0 the right- 
hand side of (A.1) assumes two values 


ay := —(b— ac) + cv a? +2, a4: —(b — ac) — суа? + b?. 
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Since ac — cv b? + a? < 0 we have a2 < 0. On the other hand, the function 
x e x cot(x) is zero at the origin and strictly decreases on [0, 7), with a discontinuity 
of the second kind at 7. Thus, (A.1) has one real root corresponding to the intersection 
of x > x cot(x) and x  —(b — ac) — cv a? + b? + z? on (0, п). 

If o, < 1 following the same argument we conclude that there is another real root 
in (0, 7) corresponding to the intersection of x > x cot(x) and x œ> —(b — ac) + 
сма? + b? + z?. If ay = 1 we have a double root at zero. 

Thus, based on the above considerations and the symmetry around the origin it 
follows that in (—7, т) equation (A.1) has four real roots if a; < 1, and two real roots 
if o, > 1. Therefore, to complete the proof we show that (A.1) has two imaginary 
roots if o4 > 1. 

Putz = iy, y e R in (А.1) to get 


ycoth(y) = —(b — ac) + ca? + b? — y?. (A.4) 


On the left-hand side we have a continuous function equal to one at the origin that 
tends to infinity as y increases. Note that a; > 1 implies a? + b? > 0 Thus, on 
the right-hand side we have a semi-circle starting at (0, о) on the ordinate, entering 
into the right half-plane, and ending at (0, a2) on the ordinate, half-encircling the 
point (0, 1) (as ay > 1 and a2 < 0). Therefore, there must exist yo > 0 for which 
the equality holds in (A.4). Since —yo also solves (A.4), we have two imaginary 
solutions. 


Lemma A.4 Assume b > 0. For c > 0 equation (2.10) has either two real roots 
іп (—т,т) or two imaginary roots. If c < 0 equation (2.10) has two real roots in 
(—7, т) and two imaginary roots. 


Proof At z = 0 the right-hand side of (A.1) equals —b + cb?. The function x > 
x cot(x) is zero at the origin and strictly decreases on [0, 7), with a discontinuity of 
the second kind at 7. Thus, if c < 0 or c > 0 and —b + cb? < 1 there is one real 
root in (0, 7), hence by symmetry in (—7, 0). If —b + cb? = 1 we have a double 
root at the origin. Next, with z — iy, y € IR equation (2.10) becomes 


ycoth(y) 2 —b 4- c(D? — y?). (A.5) 


Therefore, if c > 0 and —b+cb* > 1 the right-hand side dominates the left-hand side 
atthe origin, while the opposite is true for sufficiently large y. From the continuity of 
the two functions it then follows that (A.5) has one positive root, hence by symmetry 
one negative root. Finally, ifc < 0 the left-hand side dominates the right-hand side at 
the origin, while the opposite is true for sufficiently large y, giving a pair of imaginary 
roots. 


Lemma A.5 For every positive integer k equation (2.11) has two real roots in each 
of the intervals (—(k + 1)т,—Кт) and (km, (k + 1)7). 
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Proof The result follows from the fact that on each of those intervals the range of 
the map x +> x cot(x) is the whole real line, while the maps x — —b+ac+ c(a? T 
b? + х2)4/2 are bounded. 


Lemma A.6 Fora € IR the equation 


zcot(z)=a, ZEZ (A.6) 


has 2N roots inside the square with vertices (Na, X: Nim). The roots are real or 
pure imaginary. 


Proof For a = 0 the roots are the zeros of cos(z). If a = О from (2.13) and (2.14) 
we conclude that for sufficiently large 


latan(z)| < |2) z€ Dy. 


Thus, by Rouché's theorem 
z = a tan(z) 


has 2N + 1 roots inside the square with vertices (CEN 7, +Nir). If k > 0 it has two 
real roots in (—(k + 1)т/2,Кт/2) О (kr/2, (k + 1)7/2) if either a > О and k is 
even, ora < 0 and k is odd. 

On the other hand, in (—7, 7) there are three roots (counting their multiplicities) 
ifa > 1 апа one root if 0 < a < 1. In the latter case there are two imaginary roots 
(c.f. example on p. 255 of Hille [6]). Since (A.6) has one root less at the origin, the 
result follows. 
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On the Probability Density Function 
of Baskets 


Christian Bayer, Peter K. Friz and Peter Laurence 


Abstract The state price density of a basket, even under uncorrelated Black-Scholes 
dynamics, does not allow for a closed form density. (This may be rephrased as state- 
ment on the sum of lognormals and is especially annoying for such are used most 
frequently in Financial and Actuarial Mathematics.) In this note we discuss short 
time and small volatility expansions, respectively. The method works for general 
multi-factor models with correlations and leads to the analysis of a system of ordi- 
nary (Hamiltonian) differential equations. Surprisingly perhaps, even in two asset 
Black-Scholes situation (with its flat geometry), the expansion can degenerate at 
a critical (basket) strike level; a phenomena which seems to have gone unnoticed 
in the literature to date. Explicit computations relate this to a phase transition from 
a unique to more than one “most-likely” paths (along which the diffusion, if suit- 
ably conditioned, concentrates in the afore-mentioned regimes). This also provides a 
(quantifiable) understanding of how precisely a presently out-of-money basket option 
may still end up in-the-money. 
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1 Introduction 


As is well known, the sum of independent log-normal variable does not admit a 
closed-form density. And yet, there are countless applications in Finance and Actu- 
arial Mathematics where such sums play a crucial role, consider for instance the law 
of a Black-Scholes basket B at time Т, i.e. the weighted average of d geometric 
Brownian motions. 

As a consequence, there is a natural interest in approximations and expansions, 
see e.g. [9] and the references therein. This article contains a detailed investigation 
in small volatility and short time regimes. Forthcoming work of A. Gulisashvili 
and P. Tankov [12] deals with tail asymptotics. Our methods are not restricted to 
the geometric Brownian motion case: in principle, each Black-Scholes component 
could be replaced by the asset price in a stochastic volatility model, such as the the 
Stein-Stein model [16], with full correlation between all assets and their volatilities. 
In the end, explicit solutions only depend on the analytical tractability of a system of 
ordinary differential equations. If such tractability is not given, one can still proceed 
with numerical ODE solvers. 

As a matter of fact, our aim here is not to push the generality in which our methods 
work: one can and should expect involved answers in complicated models. Rather, our 
main—and somewhat surprising—insight is that unexpected phenomena are already 
present in the simplest possible setting: to this end, our first focus will be on the 
case of d — 2 independent Black-Scholes assets, without drift and correlation, with 
unit spot and unit volatility). To be more specific, if Св denotes the fair value of an 
(out-of-money) call option on the basket B struck at K , one naturally expects, for a 
small maturity Т, 


2 


SE (K, T) ^ (const) exp (- 


A(K)\ 1 
n T) 


г — 


JF 


And yet, while true for most strikes, it fails for К = K*; in fact, 


2 
|a» (K, n] 


А к) 1 
ӘК? 


^ (const) exp (- Т TÁC 


К=К* 


To the best of our knowledge, and despite the seeming triviality of the situation (two 
independent Black-Scholes assets!), the existence of a “special” strike level K*, at 
which the value of a basket option (here: butterfly spread!) has a "special" decay 
behavior, as maturity approaches 0, seems to be new. There are different proofs 
of this fact; the most elementary argument—based on the analysis of a convolu- 
tion integral—is given in Sect.2. However, this approach—while telling us what 
happens—does not tell us how it happens. 

The main contribution of this note is precisely a good understanding of the latter. 
In fact, there is clear picture that comes with K*. For K « K* and conditional on the 


! Extensions to spreads and vanilla options are possible and will be discussed elsewhere. 
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option to expire on the money, there is a unique “most likely" path around which the 
underlying asset price process will concentrate as maturity approaches 0. For K > 
K*, however, this ceases to be true: there will be two distinct (here: equally likely) 
paths around which concentration occurs. What underlies this interpretation is that 
large deviation theory not only characterizes the probability of unlikely events (such 
as expiration in-the-money, if presently out-of-the-money, as time to maturity goes to 
Zero) but also the mechanism via which these events can occur. Such understanding 
was already crucial in previous works on baskets aiming at quantification of basket 
(implied vol) skew relative to its components, starting with [1, 2]. As a matter of 
fact, the analysis in these papers relied on the statement that “generically there is 
a unique arrival point (of a unique energy minimizing path) on the (basket-strike) 
arrival manifold". The situation, however, even in the Black-Scholes model, is more 
involved. And indeed, we shall establish existence of a critical strike K *, at which 
one sees the phase-transition from one to two energy minimizing, “most likely", 
paths.? And this information will have meaning to traders (as long as they believe in 
a diffusion model as maturity approaches 0, which may or may not be a good idea 
...) as it tells them the possible scenarios in which an out-of-the money basket option 
may still expire in the money. 

Let us conclude this introduction with a few technical notes. We view the evolution 
of the basket price—even in the Black-Scholes model—as a stochastic volatility 
evolution model; by which we mean d B;/ B; = o (t, w)dW; (as opposed to a local 
vol evolution where с = o (t, B;)). This should explain why the methods developed 
in Part I of [6, 7] for the analysis of stochastic volatility models (then used in Part 
II, [7], to solve the concrete smile problem (shape of the wings) for the correlated 
Stein-Stein model), are also adequate for the analysis of baskets. 


2 Computations Based on Saddle-Point Method 


In terms of a standard d-dimensional Wiener process (wt, — w^), 


d 
Вт = > 5уехр (uir + c'wj) , 
i=l 
Write f = fr (К) for the probability density function of Вт; i.e. for P [Вт € [K, К 


+dK]]/dK. Of course, it is given by some (d — 1)-dimensional convolution 
integral, explicit asymptotic expansions are—in principle—possible with the 


?]t can be shown that, sufficiently close to the arrival manifold, there is in fact a unique energy 
minimizing paths. The (near-the-money) analysis of [1, 2] is then justified. 
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saddle point method. It will be enough for our purposes to illustrate the method 
in the afore-mentioned simplest possible setting: 


d=2, S) = $ = 1, u! = u? =0, o' =0° 21. 


In other words, Br = exp (Wi) + exp (№2). We claim that for some constant 
co = co(K) > 0 


exp (-459) -l(co-FO (T),  whenK £ K*, (1а) 
f (K) = kt (1) 
exp(- 562) ta (co + O(T)), when K = к", (1b) 
with 
K* = 2e © 5.43656 
and 
A(K) = inf{hx(x) |x € [0, К]) 
with 
hx (x) := (logx)? + (log(K — х))?. (2) 


Note that for K < K* we can explicitly solve this minimization problem and obtain 
A(K) = log(K/2)? with corresponding minimizer x* = K/2, corresponding to 
the single local extremum of hx. For K > K*, we have two global minima, which 
cannot be given in closed form, and hence A(K) can only be computed numerically 


(Fig. 1). 


(a) (b) (c) 

e +] e 4] | 2-4 

о 4 ao ~ ao У] 

о У о = о 7 

* У + 4 + 4 

(QN 4 сч —] N ч 

о d о + о 4 
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hx for K < K* hx for K = K* hx for K > К“ 


Fig. 1 Plot of x for different choices of К.а For К < K* there is a unique global minimum at 
x* = K/2 which is non-degenerate in the sense that A" (х*) > 0. b For K = K* there is a unique 
global minimum at x = K/2 which is degenerate in the sense that A"(x*) = 0. c For К > K*, 
x — K/2 gives alocal maximum. There are two symmetric global minimizers, which are not given 
in closed form 
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The stock price S has a log-normal distribution with parameters и! = 0 and 
Еі = go /T = JT, where the density of the log-normal distribution is given by 
filie dois gr (3) 
х) = ех Я 
ps Jp. 222 


Obviously, the density of the sum of these two independent log-normal random 
variables satisfies 


K 
f(K) = | лн — x) fu  p00dx. (4) 
б 
Using our special parameters, the integrand is of the form 


1 һк(х) 
Ful (К — х) fu 20) = TaK — 3) exp ( T ) ; 


In order to apply ће Laplace approximation to (4), we compute ће minimizer for 
hx, which is found by the first order condition 


logx log(K—x) | 0 


h’, = 0 
Ka) md K—x 


(5) 
Clearly, this equation is solved by choosing x* = K /2—which is the unique global 
minimizer iff K < 2e and a local maximizer otherwise, in which case we have two 
global minima xf < K/2 < х5. Assuming K < 2e, we can check degeneracy of 
that minimum directly by computing 


1 — log(K /2) 
hie (х*) = ҺЪ(К/2) = 16-0800, 
апа 
hL(x*) = 0 «= К = 2е. (6) 


With more work one can see that also the global minima Xis х5 „in the case К > 2e, 
are non-degenerate. Hence, whenever К 5 2e a standard Laplace method leads to 
the expansion (1)a. In the remainder of this section, we consider the degenerate case 
and establish (1)b. 
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Choosing K — 2e and, correspondingly, x* — e, we obtain the Taylor expansion 
A) (i 
к (х) 


hk (x) = hra) +5 (х—х*)#+О((х—х*)5), with hg (x*) = 2and AP (x*) = 
20e^^, we obtain the Laplace approximation 


n- Ы 1 һк(х) 4 
FC у= [| zr 2T ) 8 


1 K 1 5e74(x — K /2)* 
= sa | exp ( т) exp ( ТЭТ Ja (1 4-O(T)) 


3!^r (1/4) 1 1 
= Уу вя exp ( =) "EL (1+O(T)), 


where we used 
ü 4 Г(1/4) 
exp(—ox )dx = 5 га > 0. 
Же 2a 


Thus, we arrive at (1)b. 


3 Large Deviations Approach 


Our main tool here are novel marginal density expansions in small-noise regime [6]. 
This was used in order to compute the large-strike behavior of implied volatility in 
the correlated Stein-Stein model; [11, 16]? 

In fact, the technical assumptions of [6] were satisfied in the analysis of the Stein— 
Stein model whereas in the (seemingly) trivial case of two IID Black-Scholes assets, 
the technical assumptions of [6] are indeed violated for a critical strike К = К“. 
The necessity of this condition is then highlighted by the fact, as was seen in the 
previous section, 


2 
| sac (K, n) 


A(K*)\ 1 
aK2 ) 


^» (const) exp (- T TUT 


К=К* 


The computation of K * can be achieved either via a geometric construction borrowed 
from Riemannian geometry, which relies on the Weingarten map, or by some (fairly) 
elementary analysis of a system of Hamiltonian ODEs. In fact, the Hamiltonian 
point of view extends naturally when one introduces correlation, local and even 
stochastic volatility. Explicit answers then depend on the analytical tractability of 
these (boundary value) ODE problems. (Of course, the numerical solution of such 
problems is well-known.) 


3Similar investigations have recently been conducted in the Heston model; [10, 13] and the refer- 
ences therein. 
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Inthe following, we review [6]. Consider a d-dimensional diffusion (Xf ) 
by the stochastic differential equation 


0 given 


dX? = b (e, X?) dt + eo (X*)dW;, with X§ = xf € R^, (7) 
and where W — (W!, ..., W™) is an m-dimensional Brownian motion. Unless 
otherwise stated, we assume b : [0, 1) x R? > Rf, о = (01,...,оһ) : ВА > 
Lin (R", к“) and x, : [0, 1) > IR? to be smooth, bounded with bounded derivatives 
of all orders. Set оо = b (0, -) and assume that, for every multi-index o, the drift 
vector fields b (=, -) converges to oo in the sense^ 

ƏŽb (e, -) > 08 (0, -) = де оуу (-) uniformly on compacts as ғ | 0. (8) 
We shall also assume that 


дер (е, -) — deb (0, -) uniformly on compacts as = | 0 (9) 


and 
хр = Xo + eĝo +o (e) ase | 0. (10) 


Theorem 1 (Small noise) Let (X^) ре the solution process to 


ах? = b (s, Xf) dt + во (Xf) AW, with X§ = x§ € R^. 


Assume b (£,-) — oo () in the sense of (8), (9), and Хо = xy > xo ase > 0 
in the sense of (10). Assume non-degeneracy of с in the sense that o.o" is strictly 
positive definite everywhere in space.’ Fix y € Ҝ!, Ny := (y, ) and let Ky be the 
the space of all h є Н, the Cameron-Martin space of absolutely continuous paths 
with derivatives in L? ([0, T], R”), s.t. the solution to 


ag? = oo (дї) а +$ (99) dti. ө ovo e t 


ї=1 


satisfies on € Ny. In a neighborhood of y, assume smoothness of © 


EM E! 
А (у) = int (5101 :h ex. 


^f (7) is understood in Stratonovich sense, so that dW is replaced by od W, the drift vector field 
b (e, -) is changed to Ё (e, -) = b (e, -) — (22/2) У" ої: до. In particular, oo is also the limit of 
b (e, -) in the sense of (8). 

>This may be relaxed to a weak Hoermander condition with an explicit controllability condition. 
буғ #K™"" = 1 smoothness of the energy can be shown and need not be assumed; [6]. Note also 
that in our application to tail asymptotics, with 0-scaling, 0 є (1, 2}, the energy must be linear resp. 
quadratic (by scaling) and hence smooth. 
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Assume also (i) there are only finitely many minimizers, i.e. кү" < оо where 
qin К " 1 2 ; 
y= ho є Ky : 2 Воїн = А (У) : 


(ii) xo is non-focal for Ny in the sense of [6]. (We shall review below how to check 
this.) Then there exists со = co (хо, y, T) > 0 such that 


Y$ = ПХ = (xs, .... x57). 1 «I «d, 


admits a density with expansion 


A(y) max{ A’ (у). Yr (һо) оек!) 
E 


fy T)=e e 


€ ‘(co + О (&)) ase | 0, 


where A’ denotes the gradient of A. 
HereY =Y (ho) = hae sida Ў!) is the projection, Y = ПХ, of the solution to 
the following (ordinary) differential equation 


dX, = (a.b (0. 4" ко) + «e (Ø Goo (0 Ridt + dsb (0, фу° схо) а, (11) 


Xo = Хо. 


Remark 2 (Localization) The assumptions on the coefficients b, с in Theorem 1 
(smooth, bounded with bounded derivatives of all orders) are typical in this context 
(cf. Ben Arous [3, 4] for instance) but rarely met in practical examples from finance. 
This difficulty can be resolved by a suitable localization. For instance, as detailed in 
[6], an estimate of the form 


lim lim sup &?logP [rg < T] = —оо. (12) 


R—oo £0 


with tr :— inf {t € [0, Т]: ѕир, ср |х| > R} will allow to bypass the bounded- 
ness assumptions. 


3.1 Short Time Asymptotics 


The reduction of short time expansions to small noise expansions by Brownian 
scaling is classical. In the present context, we have the following statement, taken 
from [6, Sect. 2.1]. 
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Corollary 3 (Short time) Consider dX; = b (Xj) dt + o (Х,) dW, started at Хо = 
xo € В“, with C®-bounded vector fields which are non-degenerate in the sense 
that o.o" is strictly positive definite everywhere in space. Fix y € Rl, Ny := (y, -) 
and assume (i), (ii) as in Theorem 1. Let f (t,-) = f (t, y) be the density of Y, = 
(X1, ..., ХІ). Then 


2 
fy (сопа) exp ( а = 2) aito 


where d (хо, y) is the sub-Riemannian distance, based on (01, ..., Om), from the 
point xo to the affine subspace Ny. 


3.2 Computational Aspects 


We present here the mechanics of the actual computations, in the spirit of the Pon- 
tryagin maximum principle (e.g. [15]). For details we refer to [6]. 


e The Hamiltonian. Based on the SDE (7), with diffusion vector fields o1, ...,0; 
and drift vector field oo (in the = — 0 limit) we define the Hamiltonian 


m 


1 
H (x, p) := (р, o0 00) + 7 У, (p. о (к)? 


ї=1 


= (p, оо 09) + $ (р, (007) бор). 


Remark the driving Brownian motions W!...., W” were assumed to be inde- 
pendent. Many stochastic models, notably in finance, are written in terms of 
correlated Brownian motions, i.e. with a non-trivial correlation matrix Q = 
(o^! TSJE m), where d (W', wi), = w!/dt. The Hamiltonian then 
becomes 


Нор) = (р, оо (9) + 5 (p. (0207) оор). (13) 
e The Hamiltonian ODEs. The following system of ordinary differential equations, 
Gej- dpH (x(t), p (1) ) m 
p) —dxH (x (t) , pH) 
gives rise to a solution flow, denoted by Н, «о, so that 


Н, „0 (хо, po) 
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is the unique solution to the above ODE with initial data (xo, po). Our standing 
(regularity) assumption are more than enough to guarantee uniqueness and local 
ODE existence. As in [5, p. 37], the vector field (dH, —9,7H) is complete, i.e., 
one has global existence. It can be useful to start the flow backwards with time-T 
terminal data, say (xr, pr); we then write 


Hir (Xr. pr) 


for the unique solution to (14) with given time-7 terminal data. Of course, 


H;—7 (Hr.-0 (хо, ро)) = Hro (хо, ро). 


Solving the Hamiltonian ODEs as boundary value problem. Given the target 
manifold Ма = (a, -), the analysis in [6] requires solving the Hamiltonian ODEs 
(14) with mixed initial-, terminal—and transversality conditions, 


x (0) = xo € R4, 


x(T) = (y, ) e R' RI, (15) 
p(T)2(,0) є В! e R”. 


Note that this is a 2d-dimensional system of ordinary differential equations, subject 
to 4+1-+Е (d — 1) = 2d conditions. In general, boundary problems for such ODEs 
may have more than one, exactly one or no solution. In the present setting, there 
will always be one or more than one solution. After all, we know by [6] that there 
exists at least one minimizing control ho and that can be reconstructed via the 
solution of the Hamiltonian ODEs, as explained in the following step. 

Finding the minimizing controls. The Hamiltonian ODEs, as boundary value 
problem, are effectively first order conditions (for minimality) and thus yield can- 
didates for the minimizing control ho = ho (-), given by 


| (о (х C). PO) 
= u | (16) 
(om (x ) PO) 


Each such candidate is indeed admissible in the sense һу € Ka but may fail 
to be a minimizer. We thus compute the energy 1012; = (хо, po) for each 
candidate and identify those (“ho € Keim») with minimal energy. The procedure 
via Hamiltonian flows also yields a unique po = po (ho). If оо = 0—as in our 
case—the energy is equal to H(xo, po), otherwise the formula is slightly more 
complicated. 

Checking non-focality. By definition [6], xo is non-focal for № = (у, -) along 
ho € Ж!" in the sense that, with (xr. pr) := Hr<o (хо, po (ho)) € TR, 
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0 
(5.9) (4,9) —(0,0)7t Ho—T (x + (°) ‚рт + (9, o) 


is non-degenerate (as d x d matrix; here we think of (3, 9) € R^! x R! = В 
and recall that z denotes the projection from 7*IR7 onto Rf; in coordinates 
л (X, p) = x). Note that in the point-point setting, xr = y is fixed and only pertur- 
bations of the arrival “velocity” pr —without restrictions, i.e. without transversal- 
ity condition—are considered. Non-degeneracy of the resulting map should then 
be called non-conjugacy (between two points; here: xr and xo). In the absence of 
the drift vector field oo, this is consistent with the usual meaning of non-conjugacy; 
after identifying tangent- and cotangent-space д;| 4-07 Ho.— 7 is precisely the dif- 
ferential of the exponential map. 
e The explicit marginal density expansion. We then have 


fey, T) = е 79/8/81 (co + О (єў) as e | 0. 


with c; = A (у). The second-order exponential constant c? then requires the 
solution of a finitely many ( #0" < oo) auxiliary ODEs, cf. Theorem 1. 


4 Analysis of the Black-Scholes Basket 


For a general multi-dimensional Black-Scholes model, we have a Hamiltonian 


1 
HG. р) = 5 (р, (o(x)20(x)")p), 


with o (x) = (o!xl,..., 0" x"), While the corresponding Hamiltonian ODEs can 
be solved in closed form, the boundary conditions lead to systems of non-linear 
equations, which we cannot solve explicitly any more. While numerical solutions 
are, of course, possible, we restrict ourselves to the extremely simple setting of 
Sect. 2, in order to keep maximal tractability. 

Consequently, we have the Hamiltonian H(x, р) = 4 ((ox! pl? + (ox? p?y)). 
The solutions of the Hamiltonian ODEs started at (xo, po) satisfy 


2.151 
xde” Xo Pot 


xer orit 

Н, —0(X0, ро) = 2 (17) 
1 „=о2х1ріг 
роё Xo Po 


pee? % Pot 
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which can be easily seen from the observation that H is constant along solutions of 
the Hamiltonian ODEs together with symmetry between (x!, p!) and (x?, p?). This 
immediately implies that the inverse flow is given by 


2.0.51 
ape X pt 


Ho Gi, Pr) = (18) 


Now we introduce the boundary conditions. Note that, contrary to Theorem 1, we 
now project to the linear subspace [x : х! + x? = K}. Thus, the terminal condition 
on x translates into xi + a = К —we need to end at the target manifold—, whereas 
the transversality condition translates to pr being orthogonal to the target manifold. 
Evaluating these conditions at T — 1, we get 


xb = 53 = 1, 
xg = 2 = 1, 
2 
1 2 
Pi — pj = 0 
It is a pleasant exercise to check that solving for x i =: x and x? = K — x then 


leads exactly to the first order condition (5) encountered in Sect.2. With identical 
arguments, assuming K < 2e from here on (and disregarding the case K > 2e where 
closed form computations are not available), we find that the optimal configuration 
must satisfy xf = (K/2, K /2). Inserting this value into the first two components 
of (17), we obtain the equation 


í = e? P «= р, = log (5) lan [== 1,2, 


This implies that рї = (3x log(K /2), E log(K /2)). Moreover, we see that the 


minimizing control satisfies 


hi Gea E Ea (19) 
00) = Din? 2 log(K/2) J^ 
ox (t) p*(t) срд DEA 
see (16), implying that the minimal energy is given by 


_ log(K/2? 


1 
MK) = 5 llholliy = —3— = (хо, ро). (20) 
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Regarding focality, we have to check that the matrix: 


&|, Hott e. 7D. р) d, „НД уба, pi n0.) 
M(x], pi) := | 
&| _„Нф би +e, 0), pi) NM HQ. yO. pi n. D) 


(21) 


is non-degenerate when evaluated at the optimal configuration (xf, рү). A simple 
calculation shows that 


2.1.1 2..1,1 2.1.1 
e? Ph iplo e” xpi o? G@ire * xi Pi 
M (xı, pi) = 
2,2,2 3-5 0 2,2,2 f? 
—e 9? "n ctxlplo?e 9 AVP —0? (x?) e77 aun 


implying that 
2. 
g (1 — log(K /2)) E 


М(хї, р?) = 
“ (чк а: 


and we can conclude that 
det M (x, р?) = 207 (log(K/2) — 1), 


which is zero if and only if K — 2e. We summarize the results of this calculation as 
follows: 


e In the generic case К Z 2e, the non-focality condition of Theorem 1 holds true, 
and we obtain (from Corollary 3) the following (short time) density expansion of 
Br = exp(o №1) + ехр(о We), expansion 


LE) 
T | VT 


When specialized to unit volatility, we recover precisely (1)a. 

e For K = 2e, the initial stock price is focal for the minimizing configuration, so 
the non-focality condition of Theorem | fails. And indeed, we want it to fail for 
the actual expansion in this case, namely (1)b, is not at all of the generic form 
predicted by our theorem. 


Kt Z (co + O (T)) 


Remark 4 It is immediate to use this analysis to deal also with the case of non-unit 
(but identical) spots 5 = 50 by scaling the Black-Scholes dynamics accordingly, i.e., 


by replacing K with К / Ši. Hence, in this case focality happens when log (&) = 1, 
0 


i.e., when К = 25де. 
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Remark 5 The question arises if the critical ("focal") case К = 2e, with atypical 
algebraic factor T 7/4 cf. (1)b, can also be recovered by a general theorem. Related 
results in [14] and also [17] suggests that this may indeed be the case but would 
require substantial additional work. 


5 Extensions: Correlation, Local and Stochastic Vol 


5.1 Analysis of the Black-Scholes Basket, Small Noise 


In Sect. 4 we analyzed the density of a simple Black-Scholes basket with dynamics 
dB, = S}odW;  S2eaW?. 


As explained in Sect.3 the analysis is really based on a small noise (small vol) 
expansion of 
dB = S} oed W} + S? oedW?, 


run til time T = 1. Consider now a situation with small rates, also of order e. In 
other words, | | | 
d$S,* = rS; edt + Sj ^oedW!, 


and then Bf = Ss + AE as before. We still assume 55 = 1. Alook at Theorem 1 
(now we cannot use Corollary 3) reveals that the entire leading order computation 
remains unchanged (at least at unit time and with trivial changes otherwise). The 
resulting (now: small noise) density expansion of Ву |71 is more involved and 
takes the form 


A (K) 2r log(K /2)\ 1 
KR exp (- a ) ex (EE) p (co + О(є)) . (22) 


Here A (К) is given in closed form, cf. (20), so that A’ (K) = DUIS is also 
explicitly known. Furthermore, under similar restrictions on К as before, ho is (still) 


given by (19), so that 
ho _ ((K/2y 
Do (2); 
Thus, the ODE for X (see Theorem 1) is given by 


dE, _ Ң (K/D) 
e cts [Ds er (Go). Xo = Xo = 0, 
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xi у 
Xi; =ғ{1- {= ; 
2 log2 
implying that Ў = Х | +x 1 = rK/log(2). Thus, the second exponential term has 
the form given above. 


which has the solution 


5.2 Basket Analysis Under Local, Stochastic Vol etc. 


One can immediately write down the Hamiltonian associated to, say two, or d > 2 
assets, each of which is governed by local vol dynamics or stochastic vol, based 
on additional factors. In general, however, one will be stuck with the analysis of 
the resulting boundary value problem for the Hamiltonian ODEs; numerical (e.g. 
shooting) methods will have to be used. In some models, including the Stein-Stein 
model, we believe (due to the analysis carried out in [7]) that, in special cases, closed 
form answers are possible but we will not pursue this here. Instead, we continue with 
а few more computation in the Black-Scholes case for d assets. 


5.3 Multi-variate Black-Scholes Models 


In the multi-variate case d — 2 of a general, d-dimensional Black Scholes model 
with correlation matrix (о;у), the Hamiltonian has the form 


d 
Hx, p) = 2 > puo px a4 x! p’. 


ij=l 


Thus, the Hamiltonian ODEs have the form 
meh! Y math i=l,...,d 
# = cel Small L1... 


Rd e itis again easy to see that 2 ar ‘(t) p (t) = 0, implying that x! (t) p'(t) = 
m Po: The Hamiltonian flow has the fait 


464 C. Bayer et al. 


d 
(sevo [o (Ei оно! ht), 


(23) 
(v) exp [=< (xi рпа! pisi) t )r 


A; —0(X0, po) = 
1=1 


Using again that р! (t)x!(t) = р! (0)х! (0) for any l, we obtain the inverse Hamiltonian 


flow Р 
(xtexp[-o! (SL ue. Ta 


Host (x1, Pr) = d (24) 
(неви (amotis) 
The boundary conditions—at Т = 1—аге now given by 
X0 — So (25a) 
d 
> х'@) =К (25b) 
p) = pa) =- pa. (25с) 


Indeed, the transversality condition (25с) says that the final momentum p(1) is 


orthogonal to the surface | же, y =K |в whose tangent space is spanned by the 


collection of vectors еј — еу, / = 2,...,d, with еј, . . . , eg the standard basis of R4, 
The equations (25) are certainly not difficult to solve numerically, but an explicit 
solution is not available, neither in the general case nor in the case of d uncorrelated 
assets. 


Remark 6 The main point of this calculation is that while explicit solutions are no 
longer possible in a general Black-Scholes model, the phenomenon (1) potentially 
appears in all Black-Scholes models. Moreover, we stress that the non-focality con- 
ditions are easily checked numerically. 


Remark 7 Note that the discretely monitored Asian option can be considered as 
a special case of a basket option on correlated assets. Indeed, let us consider an 
option on 


1 N 
P. У Su, with (for simplicity) t; = iAt, i=1,...,N. 
i=l 


For each individual i € ( 1, ..., № } we have, for fixed At > 0, the equality in law 


= о Віл зо At _ c! Wi .—1(0)?At 
$ = Soe = Soe 
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for o! := Vio and Wi, == Вл Ali. In law, the vector (Wi, wee WA) corre- 
sponds to the marginal distribution of an N-dimensional Brownian motion at time 
At with correlation pj; — min(i j) ] < i, j < N. Thus, the Asian option corre- 


vij 
sponds to an option on the basket with 50 = So, с' as above and a correlation matrix 
pij With maturity Ar. Moreover, the asymptotic expansion of the price of the Asian 
option as Ат — 0 corresponds to the short-time asymptotics of the basket. 


Remark 8 А small-noise asymptotic expansion of the continuous Asian option on 
i. S;dt is also possible by the techniques of Sect.3 (with ellipticity conditions 
replaced by weak Hérmander conditions). Essentially, this is equivalent to letting 
N — оо in Remark 7—but more direct. 


As in the two-dimensional case, the boundary conditions can be solved explicitly 


in the fully symmetric case, when о! =o апа, вау, S$ = |. For suitable K the 


optimal configuration is 


х = (1,...,0)7, xf =(K/d,...,K/a)? 


log(K /d) log(K /d)\7 d d T 
рў = ( NOR ) | pp E. log(K /d), ..., SE lost /d)) . 


c? c? 
Introducing 
1 €2 c + €d 
1 —є2 
q-— €i > = : , 
1 —єа 


we obtain (for the case of d uncorrelated assets) 
М(х1, pi) := до, |0 zHocai(xi +3, pı +q) 
| fab 
ас}? 


where а = (ар,...,ад)! € ЩФ-ЭХ!, p = p(1,...,1) e RIX), G = 
diag(go,..., ва) € IRV-Dx(4-D with 


LL 
ay = — (o y 01)2е 6 rri l= 1,...d, 
ba [i= etapi], 


а= |16) е6", 1=2,...@. 
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In the symmetric case, we can evaluate M at the optimal configuration and obtain 


-c?K [1—1og(K/4)] $ --- [1—log(K/d)] 4 
—о?* = (1 — log(K/d)] 3s 0 

M(x}, рү) = rl | Ls | ; 
—o7& 0 + —[1 —log(K/d)] & 


whose determinant can be seen to be 
а 14! 
det М(х, p) = (—1)%с?К [o — log(K /d)) Z| А 


Thus, the non-focality condition fails if and only if К = de. Moreover, we obtain 


the ener: 
d log(K /d)* 


AGO = Hi. p) = 5— 3 


6 A Geometric Approach to Focality 


In this final section we take a more geometrical look at the non-focality condition 
appearing in Sect. 3.2. Consider the Black Scholes model 


45] = a SjaW;, law, aw!) = pi, jdt. 
t 


We change parameters S — y — x, by 


«(d 
i = So i „эй i=l d 


where p denotes the correlation matrix of W and p = LL’ its Cholesky factoriza- 
tion. Obviously, Ss = Sher "In terms of the x-coordinates we have 


xt = x (Е) = Lip log (8/52) /o?, 
Si = si (x) = Shei LPP. 


The advantage of using the chart x is that the corresponding Riemannian metric 
tensor is the usual Euclidean metric tensor. Thus, we simply have 


d (So, S) = |xo — x| 
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and the geodesics are straight lines as seen from the x-chart. Note furthermore that 
S = So is transformed to x = 0. | 

The payoff function of the option is given by (> wi Sp — К у. We normalize 
w; = l апа T = 1. The strike surface F = [s € RZ |2, Si = K}, which is 
(a sub-set of) a hyperplane in S coordinates is, however, transformed to a much more 


complicated submanifold in x coordinates. Re-phrasing the equation >", S5 = Кіп 
y-coordinates and solving for y gives 


d—1 

Ч ї а ipyp 

yf = log (« — > se” M MUI Jot, 
i-l 


with (LU ) = (1; yS which implies—using that L and L! are lower-triangular 
matrices — 


d-1 О | d—l 
144х4 = log (« == >, Sie 22 1х! ) is je = >, peus. 
і=1 k=1 
For sake of clarity, let us introduce the notation q = (q!, eed q! =!) = (xl, xdg 


х). A parametrization of the strike surface F is then given by the тарф: С 
R! — R4 with 


U :— L eRe! 


d—1 
А ivi ip4p 
> Sje” 22-114 Zl 


i=l 
and 
1 d-1 "M 4-1 
. É 1 їр p А 
9(q) := (a та L (« — У Spe Zrt” ys! Jo? — Zieh). 
i=l k=1 


Note that by the change of coordinates, we are implicitly assuming that S^ > 0 for 
all i. Moreover, the standard basis е (p), ..., ел—1(р) of the tangent space Tp F to F 
at p = o (q) is given by the columns of the Jacobi matrix of o evaluated at q, more 
precisely we have 


=, ; 28 j j j Га 
jaa 1 | 1 Efa oiL sgen Zr ba 
ег(р) = | 6; j=l? тай 


[ 4! 
d d—1l oi oi VJ ЯША 
с K = S; > T 
x 2. 0° rel 
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fori = 1,...,d — 1 and p = g(q). Consequently, the normal vector field N to S at 
р = ф(9) is given by 


1 [а Eöl oiris Eia 0 Is 
N(p) = HL” ‚= м ; 
рор | | za | ьа к — Yd sien Za pe idi 


і=1 


where о is a normalization factor guaranteeing that |N (р) | = 1, i.e., 


= ; ЧН j 7 Ј К? 
1 рэга : c! LI Sle с? $7 Lq 
gd K= xw | sje oi yl Liq r 


pq 


d—1 1 
«(р) = |1 pa тау 


The Weingarten map or shape operator Ly : ТЕ — TyF is defined by 
Lp (d9,-19(v)) = —d(N o q)(e ^ (p) - v. 


v e R! = Tp-1 U> see [8]. In other words, for g(q) = p, we interpret N as 
a map in q and —Ly is the directional derivative of that map. We study the Wein- 
garten map since it gives us the curvature of the surface F. Indeed, the eigenvalues 
K1(p). ..., Ка—1(р) of the linear map Lp : ТЕ — ТЕ are called principal curva- 
tures of F. Then the focal points of F at p are given by 


(p + l меру <i <d-—1 such that k;(p) 4 0}. 
ki (p) 


In order to compute the eigenvalues of the shape operator, we need to compute 
the representation of Lp in the standard basis (e; (р), ..., ea.1(p)). Let us denote 
this matrix by L(p), then we obviously have 


_ a 
L(o(q)ij = Б оф)(9), е: (ф(9))), 47 = 1,...,4- 1. 


The principal curvatures kı (р), ..., kg 1(р) are, thus, the eigenvalues of the (d — 1)- 
dimensional matrix L(p). 

Since the calculations become too complicated in the general case, we now again 
concentrate on the case of two uncorrelated assets, i.e., d = 2 and p = L = h. In 


this case, we have 
E ше 
е = { 1, Р 
10р ок 51027141 


: = (e!sje tio (к - se). 


N(o(q)) = 


Vo? Gp3e e! + (02)? (K — Sheo'a") 
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Thus, the Weingarten map is given by 
Lp(vei(p)) = v«(p)ei (р), 


where for q = (q!) € R 


K (o y (o?? sje" (51е7'9 — K) 
к(ф(9)) = К1(ф(д)) = 


3/2 
| e o csptee t + (02)? (Shera! — К) 


is the curvature of the curve F in R2. We see that « = 0 if and only if K — 80 e? 4! ; 
i.e., at the boundary of the surface F. Otherwise, к is negative. 

Here, both components of N (p) are positive on F. Consequently, for any p — 
ф(9) € S there is precisely one focal point f = f(p) є R2, which is given by 


sie" * [2(0)°K — (e? + 55s) | - (о?у?К? 
cl(c? K (К — 51е9'4!) 


Р=4' + 


, 


І К — Sie?’ o? Kei (01) + (02)2) She?" 
2 _ 0 | 0 
Ё = = log 2 132 1 22 

о? 52 CP (esl (01202 


Denoting р = (x!, x?) and re-introducing the short-cut notation 5? = Sjer, 
i = 1, 2, (noting that S! + 52 = К) we can express f as 


S! [2(0?? K = ((о1)2 + (о2)2)51] JEN (07)? K? 


iig] 
f =x + ol(o2)2K S2 > 
i 9, 8 [ZOK — ((о!)? Fes | — (0° K? 
асдаг (01920251 


In the current setting, let q* be the optimal configuration in q-coordinates, i.e., 
the point on F with smallest Euclidean norm. Then the non-focality condition of 
Theorem | is satisfied, if O is not a focal point to y(q*), see the discussion in the 
proof of [6, Prop. 6]. 


Remark 9 As both components of the normal vector N are non-negative on F and the 
curvature к is negative, 0 can only be a focal point if F has a non-empty intersection 
with the positive quadrant. Inserting into the parametrization of F, we see that this 
can only bethe caseif K — SÈ F S In other words: if the option is in the money, then 
the non-focality condition is always satisfied (in the two-dimensional, uncorrelated 
case). 
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Let us again use the parameters of Sect. 2, i.e., Si = 5 = 1,0! =o? = o. Then 
we consider S* = (K/2, K/2), which translates into х* — (се, eek ot), 
Inserting into the formulas for the focal points, we obtain 


log (£) – 1 
Ha“) = Р(х?) = g(z) . 
o 
So, O is focal to the optimal configuration, if and only if 


K — 2e, 


and we recover, once more, the results of Sects. 2 and 4— recall that So corresponds 
to 0 in x-coordinates. 

In Figs.2 and 3 the focal points are visualized for two different configurations 
of two uncorrelated baskets. We plot the surface F as a submanifold of R2. We 
have seen above that for any p € F there is precisely one focal point f(p). Hence, 
we additionally plot the surface {f(p)|p є /}—more precisely, part of this surface. 
In Fig.2 we show the case constructed above where the non-focality condition is 
violated. In Fig.3 the option is ITM. As explained above, in the ITM case the manifold 
F does not intersect the positive quadrant, implying that the non-focality condition 
is satisfied. 


(a) (b) 


©ч =] N — 


о о 
i E А 
— Е PO Е 
- Opt. path i |---- Focal points 
А —8— Opt. config. 
a _| a _ 
D | А 
T T | T T T T | | T 
-2 -1 0 1 2 -2 -1 0 1 2 
Optimal configuration Focal points 
Fig. 2 Optimal configuration and focal points for two independent assets with с! = o? = 1, 


So = (1, 1), К = 2e. a The dashed line depicts the optimal path between the spot price So (0 in the 
q-chart) and the optimal configuration. b Dotted lines connect some selected points on the manifold 
F with the corresponding focal points. Points marked with a triangle visualize the construction of 
the focal points. We see that 0 is, indeed, focal to the optimal configuration 
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o e 

Ѕ 7 Qa E 

0 Te) 

© У c a 

T T 

o o 

T T 

о о 

MEN т) 

о о 

ai |] si “| Кш 

0 em O t. path Б Lj Е 
| ~ Opt. ра a 2d amm 

9 E Е - Focal points 

Ө Ө —— Opt. config. 

© 4 gJ а 

DE T T T T | 9 | | | | | 
-3.0 -2.5 -2.0 -1.5 -1.0 -0.5 0.0 -3.0 -2.5 -2.0 -15 -1.0 -0.5 00 

Optimal configuration (in the money regime) Focal points 


Fig. 3 Optimal configuration and focal points for two independent assets with o! = о? = 1, 
So = (1,1), К = 2/e. a The dashed line depicts the optimal path between the spot price So 
(0 in the q-chart) and the optimal configuration. b Dotted lines connect some selected points on 
the manifold F with the corresponding focal points. Points marked with a triangle visualize the 
construction of the focal points. This example illustrates the fact that the non-focality condition 
always holds when the basket option is in the money 
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On Small-Noise Equations with Degenerate 
Limiting System Arising from Volatility 
Models 


Giovanni Conforti, Stefano De Marco and Jean-Dominique Deuschel 


Abstract The one-dimensional SDE with non Lipschitz diffusion coefficient 
dX, = b(Xjdt--oX]dB,, Ху=х, y «1 (1) 


is widely studied in mathematical finance. Several works have proposed asymptotic 
analysis of densities and implied volatilities in models involving instances of (1), 
based on a careful implementation of saddle-point methods and (essentially) the 
explicit knowledge of Fourier transforms. Recent research on tail asymptotics for 
heat kernels (Deuschel et al. Comm. in Pure and Applied Math., 67(1):40—82, 2014, 
[11]) suggests to work with the rescaled variable X^ :— &!/(—»? X: while allowing 
to turn a space asymptotic problem into a small-e problem, the process X* satisfies 
a SDE in Wentzell—Freidlin form (i.e. with driving noise edB). We prove a pathwise 
large deviation principle for the process X^ as є — 0. As it will be seen, the limiting 
ODE governing the large deviations admits infinitely many solutions, a non-standard 
situation in the Wentzell—Freidlin theory. As for applications, the ¢-scaling allows 
to derive leading order asymptotics for path functionals: while on the one hand the 
resulting formulae are confirmed by the CIR-CEV benchmarks, on the other hand 
the large deviation approach (1) applies to equations with a more general drift term 
and (ii) potentially opens the way to heat kernel analysis for higher-dimensional 
diffusions involving (1) as a component. 


Keywords Pathwise large deviations * Square-root diffusions * Tail asymptotics - 
Freidlin-Wentzell - Large deviations * Degenerate diffusions + CIR process 
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1 Introduction 


The Wentzell-Freidlin large deviation theory studies the asymptotic behavior of 
the distribution on path space of the solution to the equation dX; = b(X*)dt + 
ec (X; )dB,, X5 = x as є — 0, where B is a Brownian motion. When the coeffi- 
cients b and ø are, say, Lipschitz functions, it is easy to see (with an application of 
Gronwall's Lemma) that the trajectories of X^ converge in law to the deterministic 
solution of the ordinary differential equation dy; = b(y;)dt, ро = x. The theory of 
large deviations accounts for the rate of this convergence: denoting W the Wiener 
measure, the large deviation principle (LDP) 


W(X €T) ge 2 Moer 1) 


holds for subsets Г of the path space С([0, T ]).! Denote (Л) the unique solution of 
the ODE dy; = Б(р;)а + o(qi)dh;, ро = x, where the control h is an absolutely 
continuous path with square integrable derivative h. The rate function J is given by 
1(ф) = Ales where Л is the control steering the trajectory of the deterministic 
system along the given path à, that is (Л) = ф. When the diffusion coefficient с 
is invertible, the control л is identified by h, = alo)! (pr — b(vi)), yielding the 
typical form of the rate function 


_1 fT (à - bó? 
rosz) 07а, 


The intuition behind such a result is that we can write X€ (w) = X (ew), where X 
is the ‘pathwise’ solution of dX = b(X)dt + c(X)dB, Xo = x. If we accept that such 
a map X exists and is regular enough, then the contraction principle in conjunction 
with Schilder's theorem for large deviations of Brownian paths [12, Chap. 1] provides 
the LDP and the rate function for X^. The standard assumptions under which such a 
program is carried are conditions of global Lipschitz continuity and ellipticity for the 
coefficients, see [10, 12]. Several works have aimed at weakening these assumptions 
and extending the class of equations for which the LDP holds. Dependence on e in 
both the drift and the starting point can be introduced, and global Lipschitz continuity 
can be replaced with (essentially) local Lipschitz-continuity and conditions for the 
non explosion of the solution (building on the idea of Azencott [3] to exploit the 
quasi-continuity property of the Itó map, that only relies on local properties of the 
equation coefficients). We refer to [4] for a nice recent summary of sets of conditions 
under which the Wentzell—Freidlin estimate holds. 


'The precise statement here is — inf & 1(д) < liminfz,oc?logW(X* € T) < lim sup. e 
logW(X* eT) < — inf eF I (Q). 
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Recent research on heat kernel asymptotics [11] focuses on the tail behavior for 
correlated stochastic volatility models. Exploiting the space-scaling properties of 
the log-price process Y; in some parametric models (namely: there exists 0 > 0 
such that the rescaled variable Yf :— c? Y, has the same law as the log-price in a 
stochastic volatility model with driving noise edB;), the approach of [11] is to convert 
the asymptotic problem for the tail distribution, W(Y, > R) as R — oo, to the 
problem of small-noise probabilities, W (YF > 1) as = — 0. Then, a large deviation 
principle for the rescaled process serves as a building block to study the asymptotic 
behavior of the corresponding heat kernel (using the tools of Malliavin calculs and 
the Laplace method on path space, see [5, 7]). This approach can be fully justified, 
and explicit computations are possible, for the stochastic volatility model of Stein 
and Stein [25] (also known as Schóbel-Zhu [24] in the correlated case), where the 
stochastic volatility follows an Ornstein-Uhlenbeck process with constant diffusion 
coefficient, which is the main case-study of [11]. As pointed out in [11, Sect. 5.3], in 
the framework of models where the volatility has square-root diffusion coefficient 
(main example: Heston), or more generally a diffusion coefficient of the form x^, 
ү < 1 (as in [2, 21]), such a space-scaling approach leads to a situation where the 
same approach is not justified anymore (and a formal application of the resulting 
expansion even leads to a wrong conclusion). Quoting [11, Sect. 5.3], “curiously 
then even a large deviation principle for (the rescaled volatility process) as given 
above presently lacks justification". 

To be more specific, consider the equation dX; = (o + 8 X;)dt + c X, dB, with 
positive initial condition Хо = x > 0. Looking for a value of 6 such that c^ X satisfies 
an equation with small-noise = leads to define the rescaled process X^ :— c'/(—? x, 
which indeed satisfies the equation 


dX; = (o£ + BXẸ$)dt + co (X$) dB, X = x* (1.1) 


with 
of :2gl/ü7 0e xf iz gI/ 7 y, 


Of course, this change of variables allows to write W(X; > А) = W(X; > 1) 
using c = R-I/(1—?, As mentioned above, the question is whether a large deviation 
principle holds at all for W(X; € -) as = — 0. Note that both the initial condition 
ху and the constant term o£ in the drift coefficient tend to zero as = — 0. On the 
one hand, it is not difficult to see that X^ — 0 in law with respect to the uniform 
topology on C([0, Т]). On the other hand, writing down formally the limiting ODE 
that should govern the large deviations, one gets 


ф = bpr + olp hs, фо = 0. (1.2) 
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The Eq. (1.2) is known to admit infinitely many solutions. When h; > 0, the set 


of solutions contains the one-parameter family oO = ей! (ca m Jo Rd 


hsds) / С? 1459), with 0 > 0.2 Then, the definition itself of the map h к> y(h) 
associating the control with the corresponding solution of the ODE is not anymore 
possible. 

We will occasionally address this situation as "degenerate". Let us note straight 
away that large deviations for diffusions with non-Lipschitz coefficients have been 
studied in Baldi and Caramellino [4] Donati-Martin et al. [13], Klebaner and Lipster 
[19] and Robertson [23]. In [4, Theorem 1.2] a large deviation principle is derived 
for the family of equations dX; = b(X7)dt + eo(X;)dB;, Хо = x > 0 (note the 
strictly positive initial condition), where the function c (-) roughly behaves like сх” 
(see [4, Assumption (A1.1)] for precise conditions) and b : [0, со) њ» R is a locally 
Lipschitz function with sub-linear growth and b(0) > 0. The conditions for both a 
drift term b and an initial datum independent of =, such that b(0) > 0 and x > 0, are 
violated in the situation we consider here. In [13], b(0) = 0 and x = 0 are allowed, 
but the analysis is limited to the square-root case y = 1/2, and b and x remain 
independent of =. Note in this respect that setting b(0) = x = 0 implies X^ = 0 
for all £, and in this case a LDP trivially holds with the rate function 7(0) = 0, 
I(¢) = oo for ф Æ 0 (as stated in [13, Theorem 1.3]); in contrast with (1.1), where 
both 5*(0) = a£ and x* do tend to zero as = — 0, but coming from strictly positive 
values, so that the solution of the SDE is non trivial for every value of =. In both 
these works, uniqueness for the limiting ODE is a key point (and appears as a part 
of [4, Assumption (A2.3)] and is exploited in [13, Sect. 5]). In order to study the 
asymptotic behavior of the ruin probability W (To < T) with то = inf{t : X; = 0} 
as the initial condition x tends to infinity, Klebaner and Lipster [19] exploit a similar 
space scaling by working with the ‘normed’ process X? = X;/x, and show that a 
LDP holds for the process X* as x — oo. The major difference with our setting is 
that the initial condition Хо = 1 in [19] is fixed and does not tend to zero as x* in 
(1.1), which is one of the difficulties to encompass in our analysis. Robertson [23] 
derives LDP for a class of stochastic volatility models, including the Heston model 
with square-root volatility process. One of the assumptions used there is that the 
small noise problem for the volatility process has the same form as in Donati-Martin 
et al. [13], see [23, Assumption 2.1], and the work carried out is to transfer the LDP 
to the second component of the process (the log-price). Therefore, the work of [23] 
does not cover small-noise problems in the form of (1.1). 

We establish a LDP for a generalized version of Eq. (1.1), allowing a to be a 
function of the process. That is, we start from Eq. (1) under the assumptions: 


(Al) y €[1/2,1),0 > 0, x > 0. 
(H2) b(y) = a(y) + бу, where a is a Lipschitz continuous and bounded function, 
and a(y) > 0 in a neighbourhood of 0. 


?When б = 0, y = 1/2 andh = 1, one retrieves the textbook example of ODE for which uniqueness 
PA ; E : "a 

fails, o; = су |; |, whose solutions from фо = 0 are given by the one-parameter family o; = 

c? 2 

"S — 0) ly. 


On Small-Noise Equations with Degenerate Limiting System ... 477 


Under (H1)-(H2), (1) is known to admit a positive solution, which is pathwise unique 
by Yamada and Watanabe’s uniqueness theorem. 


Theorem 1.1 Assume conditions (H1)-(H2), and let (X1)1>0 be the unique strong 
solution to (1). Set X€ := eV/0—? X: then X* satisfies (1.1) with the constant a 
replaced by the function о(-). Then, the family {X°}-~ satisfies a large deviation 
principle on the path space С([0, T], Ё) with inverse speed £? and rate function 


1 2 Ф, — Ве: : 
Ir = т | ( Е 1o; z0)dt, 


and Іт (qp) = +оо whenever (О) 4 0 or is not absolutely continuous. 


Let us note that in the definition of /r above, the expression 25 1,0 is intended 
to be well defined for any y; € К, and it is equal to zero when v; = 0. 
It is easy to see that the unique zero of Ir is p = О, consistently with the 
fact that X€ 5 0 as = — 0. Roughly speaking, Theorem 1.1 allows to write 
W(X* € Г) = exp (-2 (inter 1т(ф) + Ф©)) for subsets Г of C(O, T) such 
that inf cr IT (Ф) = inf & Ir (ф), where the function /(£) vanishes as = > 0; we 
refer to Theorem 2.1 in Sect. 2 for the precise statements. 

According to our definition of X^, one has W(X; > 0, Vt > 0,Ve > 0) = 
W(X, > 0, У > 0) = 1. A criterium for the strict positivity of the trajectories of 
X*, based on Feller's test for explosion, can also be given (see [9, Proposition 3.1]: 
when y > 1/2, a(0) > 0 implies W(X; > 0,7 > 0) = 1, while for y = 1/2, the 
same conclusion is guaranteed by 20(у)/02 > 1 for y in a right neighborhood of 
zero—yielding the familiar Feller condition 20/02 > 1 when a is constant). Note 
that Theorem 1.1 does not assume any of these conditions for the non-attainability 
of zero; in particular for the CIR diffusion, we do not assume the Feller condition on 
the coefficients o and c. 

From Theorem 1.1, tail asymptotics for some functionals of the process X can 
be derived (which is exactly why the e-scaling leading to X^ was introduced!). The 
pathwise LDP allows to consider path functionals of the process, such as the running 
supremum, or the time average. 


Theorem 1.2 Let (Х,);>о be the unique strong solution to (1) under conditions 
(H1)-(H2), and let T > 0. Then, as R — со 


W (Xr > R) = eK P rto) (1.3) 
and 
w( sup X; > к) = е RD (г+0(1)) (1.4) 
telo, T] 
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bg — 820-9) (vp +0(1)) 
W(z[ XdzR)-e | (1.5) 
0 


The constant cT, resp. vr are explicitly known in terms of the model parameters, and 
are provided below in Proposition 2.5, resp. Proposition 3.14 for the case ^y = 1/2. 


The estimates in Theorem 1.2 can be compared with the explicit formulae available 
for cumulative distributions and critical exponents in the CIR and CEV models: 
these consistency checks are done in Sects.2.1 and 3.4, showing that the estimates 
in Theorem 1.2 are correct on the log-scale. While in the one-dimensional setting 
the large deviation approach yield by Theorem 1.1 applies to equations with a more 
general drift term than a purely affine function, it also opens the way to heat kernel 
analysis for higher-dimensional diffusions involving (1) as a component, which is 
exactly the case left open in [11]. 

Let us finally note that, due to the non uniqueness of solutions for the limiting 
system, the problem we consider here appears to be related to the issue of regulariza- 
tion by noise of ODEs. Leaving further discussions to future work, let us just point 
out here a structural difference with that setting: in that context, one considers an 
SDE of the form dX; = b(X;)dt + edB;, with unit dispersion coefficient, seen as а 
perturbation of the deterministic system x; = b(x;) with non-Lipschitz drift b (e.g. 
b(x) = sign(x)|x|"). Among the possible solutions of the deterministic system, one 
then looks at the (few) ones supporting the limiting law of X*, obtaining the so-called 
Zero noise limits of the equation; see [27] and references therein. In our framework, 
the equation for X* already possesses a Lipschitz continuous drift b(x) = аг + Gx. 
Correspondingly, the limiting system x; = бх, xo = 0, already has a unique solu- 
tion (here: the null path x = 0), which then gives the unique weak limit for X* (in 
contrast to [27, Corollary 1.2], where the limit is a probability distribution supported 
on two trajectories). As we pointed out, the difficulties in our setting come from the 
non-Lipschitz diffusion coefficient and appear at the level of the definition of the rate 
function via the control system (1.2). 

In the remainder of the document, Sect. 2 is devoted to the proof of Theorem 1.1, 
while in Sect. 3.4 we prove the different statements of Theorem 1.2. We collect in 
Appendix A the proofs of some of the more technical material. 


2 Main Theoretical Estimates 


Let Q :— C ([0, T], К), Фо :— C ([0, T], R+) denote the space of continuous 
(resp. continuous non negative) functions on [0, T]. (О, F;, F) denotes the canonical 
Wiener space, W the Wiener measure on (QQ, F;, F), and E the expectation under W. 
We denote Н = {h є АС([0, Т], R): he L?) the space of absolutely continuous 
paths on [0, Т] with square-integrable derivative (usually referred to as Cameron- 
Martin space). For a set of coefficients a(-), D, y, с satisfying conditions (H1)-(H2), 
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we denote X the W almost-surely unique strong solution of (1). We define the rescaled 


process X^ := gra X; it is clear that X^ solves Eq. (1.1) with coefficients identified 
by a£ (x) = e/V a(x) and х = eV» x, Denote b*(x) := af (x) + x. 

The following theorem gives the precise PEE announced in Theorem 1.1 in the 
Introduction. We recall that the expression ~y l 1,40 is well defined for any y € К+, 
and it is equal to zero when y = 0. 


Theorem 2.1 Let X* be the unique strong solution to (1.1). Then, 


lim sup =? log У(Х € Е) < =inf Ir (p) 


c0 (2.1) 
lim inf e? log М(Х € С) > -inf Ir (p) 


for every closed set F C 90 and every open set G С 90, where the rate function 
Іт (©) is defined by 


1 T Qr — Ве; ? 
Іт (p) := 272 n ( Цо, 0}, (2.2) 


and Іт (qp) = +00 whenever (0) £ 0 ог i is not absolutely continuous. 


Remark 2.2 We could state the large deviation principle of Theorem 2.1 on Q = 
C ([0, T], R), setting the rate function Ir (p) to +00 whenever р @ >o. Since the 
process X* is known to be positive W-a.s. for every = > 0, with such a definition 
of the rate function the LDP (2.1) holds for every closed subset F and every open 
subset С of Q. 


Remark 2.3 As pointed out in the Introduction, the rate function for a family (X^) 
satisfying dX^ = b(X*)dt + £o (X^)dB,, Хо = x, can be written as 


= КУ 
Тт(ф) = шг] ils? :heH,o() = el (2.3) 


where (Л) is the solution to the limiting ODE controlled by л, o = b(y) + a(yyh 
and yo = x, provided this solution is unique. In our setting, consider р € S(u), 
where now S(u) denotes the set of positive solutions of the degenerate ODE (1.2) 
with control parameter h = и € H: on the set {y > 0}, и is uniquely determined by 
ọ via up = й ёли. on the set { = 0}, the function ¢ is seen to satisfy Eq. (1.2) for 
any control parameter л. This means that the set of л such that р є S(h) contains 


the infinitely many elements given by 


г Pt Бы Ве; dh, Е 
дра, —ly,-0, AEH. 
t vi {or >0} + dp = 


480 G. Conforti et al. 


The control / achieving the minimum norm is obtained setting һ = 0. This gives 
йә = inf(3IAl;? : h € H, є S(h)} = Ir(¢) for the rate function I7 defined 
in (2.2). 


Remark 2.4 Assume that b : [0, оо) — К is a locally Lipschitz function with 
sublinear growth and b(0) > 0, and that X^ satisfies dX, = b(X,)dt + ec (X,) dB; 
and Xo = x > 0. Then it is known from [4, Theorem 2.1] or [8, Theorem 4.2] that 
X satisfies a LDP with rate function 


1 [7 (p-b 
ro = za f (52) а, 


and Jr (y) = oo if ọ is not absolutely continuous, where one classically agrees that 
1/q; is equal to +оо if р; = 0. We stress that the latter rate function is radically 
different from /т defined in (2.2): whenever p = 0 on some non trivial interval 
К C [0, 1], then Jr (v) = co, while in such a case the integrand in (2.2) gives zero 
contribution to [т on К. In other words, while trajectories with a zero-set of positive 
measure require infinite energy to be followed by the process X in the small-noise 
limit, they are favoured by the rate function of the process X*. 


2.1 Tail Asymptotics 


The space-scaling X^ = &'/(1—? X together with the large deviation principle (2.1) 
allow to work out tail asymptotics for functionals of the process X. The follow- 
ing proposition provides the precise constants appearing in Theorem 1.2 in the 
Introduction. 


Proposition 2.5 The asymptotic formulas (1.3) and (1.4) in Theorem 1.2 hold with 
the constant cr given by 


Ве 2801-07 


ds ao? (1—y)(1—e-290—»T) if 8 Ф 0 (2.4) 


1 я = 
IT if B — 0. 


One can see that cr does not depend on the function a(-) in the drift of X, nor on 
the initial condition x. 


Remark 2.6 Some comments are in order. 


() Comparison with explicit formulae for the CEV process. The asymptotic 
behavior (1.3) can be compared with the explicit formulae available for the den- 
sity of the СЕУ process. When a = O in (1), X can be obtained as a deterministic 
time-change of a power of a squared Bessel process (see [16, Sect. 6.4.3]). As 
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a consequence, for every Т > 0 the random variable Хт is known to admit a 
density with respect to the Lebesgue measure on the positive real line, given by 


_ 0-2) 820-427 1 (20-3 4 201-2) .-280—-T 
Ўт) = ту е ат Pag me 
Е A 
x ху I T Tc (xz; !yl- Ye" P 9. у> 0, 
(2.5) 


where 7, is the modified Bessel function of the first kind of index v > 0, and 
2 r 
d(T) = apq — e 28 0—»T) (note en passant that one has d(T) > 0 for 


every choice of the sign of 8)? The formula (2.5) is also valid for 8 = 0, when 
one replaces all the -dependent constants with their limits as 9 — 0, such as 
d(T)|g-o = (1 — PT. Using the asymptotic behavior (see [1, Sect. 9.7.1]) 


_6_ 


м272 


of the modified Bessel function /,(z) ~ as z — oo for fixed v > 0, one 


immediately obtains 


е22801—7)Т 


20-3) 
2d(T) 


y = —cpy 47», 


log fxr (y) =: gO)  — x — oo, 


with the constant ст defined in (2.4). Using some standard tools of regular vari- 
ation [6], one can then easily prove that log W(Xr > y) = log S edz ~ 


g(y) ~ —cr y? 17 as у — оо, thus showing that estimate (1.3) is exact on the 

log-scale. 

(ii) The asymptotic estimate fy,(y) < Ат ecary 7m y > 1, for the density of 
Хт was proven in [9] for the solutions of a class of SDEs containing (1) under 
conditions (H1)-(H2) (namely, in [9] the coefficients 8 and y are also allowed 
to depend smoothly on X), relying on techniques of Malliavin calculus and 
transformations for 1-dimensional SDEs. The constant ат provided there is not 
optimal. While the estimates in [9] remain valid for more general equations, the 
large deviation principle in Theorem 2.1 allows to obtain a sharp estimate on 
the log-scale. 


The asymptotic behavior W(4 [s х,а) = exp(— 201—7) (ит + o(1))) for the 
time average of the process can also be proven using Theorem 2.1: see Proposition 
3.14 in Sect. 3.4, where an expression of the constant vr is provided in the case 


3When y € [1/2, 1), the law of X7 also possesses an atom at zero, Р(Хт = 0) = тт > 0, and an 
explicit formula for the mass тт is available (see again [16, Chap. 6]). From our point of view, this 
only means that the density fx, does not integrate to 1 on (0, oo), without affecting our analysis 
of the tail asymptotics at oo. 
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3 Proof of the Main Estimates 


We prove the large deviation principle in Theorem 2.1 by first showing the exponential 
tightness of the family {X*}-, namely for every m < 0 there exists a compact set 
Km C C([0, T]) such that lim sup,_, 9 c? log W(X* € Ky) < m. We then prove the 
weak upper bound 


lim sup lim supe? log У(Х є B(y, R)) < -Ir(g) Yọ € О>о, 
R—0 £0 


and the weak lower bound 


lim inf lim inf e? log W(X^ є B(o, R)) > -Ir(g) Yọ € 9-0 


R—0 = 


where В(‹, R) denotes the closed ball in С([0, T]) of radius А, B(y, К) := (9: 
|2 — Yloo < R}. It is a general fact that exponential tightness combined with the 
weak upper bound yields the large deviation upper bound in (2.1) for any closed set 
after a covering argument (see [12, Chaps. 1 and 2]). On the other hand, the weak 
lower bound trivially provides the full lower bound in (2.1), observing that open sets 
are neighborhoods of their points. 


3.1 Exponential Tightness 


We prove the exponential tightness considering balls in the Hólder norm ||w|,, :— 
SUD, ,—T sz NE and a natural bound on the initial condition шо. More precisely, 


we define 
Kpr := (lol < Ry {wo € (0, x]. (3.1) 


It is classical that these sets are compact in С([0, T |). 


Proposition 3.1 The family of measures W(X* € -) is exponentially tight in scale 
2: 
€^, Le. 


lim. limsupe?log W (X^ € K$) = —oo 


R>+% 2-0 


1 
for every 0 < т] < 5. 


We follow [13] in the proof of Proposition 3.1. First, let us observe that for = < 1, 
W(Xg € (0, x]) = 1 so that we just need to estimate the Hólder norm of X*. To this 
end, we use a version of Garsia-Rodemich-Rumsey's Lemma, and the existence of 
exponential moments for a process bounding X* from above. 
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Lemma 3.2 Consider (Уу, t > 0) the strong solution to 
dX, = (Aloo + IBI X))dt + o (dB, Хо = x 


and define XE := gV/0—» X. Then, there exist positive constants c and C such that: 


F (exp (ce? yia) «C, Wte[0,T], Ve» 0. (3.2) 


Proof According to the definition of X*, one has gue = AU so that 


(3.2) holds if and only if E [exp (cx? »j < C forall t € [0, T]. When y = 1/2, 
(3.2) follows from the asymptotic behavior of the density of the CIR process for 
large arguments (see e.g. [16, Sect. 6.3.2, p. 358]); for general y and 3 = 0, from the 
asymptotic behavior of the density of the classical CEV process as stated for example 
in [16, Lemma 6.4.3.1, p. 368]. For general у and д, we rely on a slight generalization 
of the proof of [9, Proposition 3.3]; we leave the details to Appendix A. 


The next proposition is a direct consequence of Garsia-Rodemich-Rumsey's 
Lemma; see Appendix A for a statement of this lemma and a proof of Proposition 3.3. 


Proposition 3.3 Let о € Q. Fixe, К > 0,7 є (0, 1) Assume that: 


Lf (S - aste к (R) (3.3) 
0 JO Р ENTE i Ed | 


with Ke »(R) := 1 ехр (T Т"- (E — 4Т12- " — K,)) — 1T? and Ky := 


SUD, єго, T] 2u'/2—" log(u-!) < оо. Then, 
wll) < А. (3.4) 


In the proof of Proposition 3.1, we exploit a localization procedure: for any = > 0 
and n € N, define the process X^" as the strong solution of the SDE with truncated 
coefficients: 


dX," = Б (ХӮ лп)а + oe (X;" ^n) dB, Ху" = х°. (3.5) 


The paths of X^" can be decomposed in their martingale part and locally bounded 
variation part 
dX;" = аА" Fc dM;" 


with 4M; " = &c(X;" л п)?ав, and dA," = b°(X;"" ^n)dt. We shall also define 
for every и, = the stopping time T^" :— inf lt > 0: Хғ > n]. By the pathwise 
uniqueness for Eq. (1) (equivalently, (3.5)), we have that up to time 7” the processes 
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(x$) 110.7] апа (Хх) 110.7] coincide almost surely. More precisely, Vn є IN and 
=> 0 
ИО о Yt € [0, T]) = 1. (3.6) 


Proof of Proposition 3.1 Let us fix 7 € (0, 1). By (3.6), 


W (IX*|; = R) < W (IX^"]; = R, T^" 2 T) + W (T*" <T) 


< W (IX®" ll} = А) +W (T^" < T). (3.7) 


Let us estimate the first term in (3.7). Using Proposition 3.3 and Markov's inequality 
we have for every =, n: 


en T of ME = Ms" 
МОМ = R) < W exp | € ^————— — | dsdt > Ken (R) 
0 JO 


|t — s| 
stig ff eo (LEEN) at 
^ Ken(R) Јо Jo A It — s 


Applying the exponential martingale inequality E (exp(AM;)) < (exp 222 (M)1)) 


[22, Chap. IV] with А = ЕЛЕЙ? for t > s one has 


Мг" =. z) ( ( 20? j є,п 2y )) 
г дех RM —————À 2 ex SS Хг ^n dr 
( P( e2./t—s m E e2(t—s) Jy (x: ) 


< 2ехр С , 


Therefore, using the definition of the constant K-,,(R) in Proposition 3.3 
R 
lim sup 22 log W (I M^" |; > R) < es a n (3.8) 
e>0 8 
For the bounded variation part A^", we observe that 


W (ГА > R) < „(тә sup bf (X^ An) > к). 
te[0,T] 


Under hypothesis (Н), р(х) < [aloo + бх for every x. Therefore, for every =, n 
W (IAM > R) < W(TI"aloo + Bn) > R) = 0. (3.9) 


where the last identity holds as soon as А > T!~"(laloo + Gn). 
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We now deal with the second term in (3.7). It follows from the comparison theorem 
for one-dimensional SDEs [17, Proposition 5.2.18], that X; < x t < T, almost 
surely, where X* is defined in Lemma 3.2. For every fixed y anda > 0, it is a simple 
exercise to show that the function y +> exp(ae~?(1 + y)?"-”), у > 0, is increasing 
and convex if £ is small enough.* For such values of e, since x ғ is a submartingale, so 


is exp (ae + ipia). Then, we can apply Markov's inequality and Doob's 


L?-inequality, obtaining: 


ү [ree < T) = w( sup X; > n) 


te[0,T] 
E ~ .\2(1-7) 
W| sup exp[ae (1 + x) 
te[0,T] 
> exp Са + =) ) 


< exp (-ae?Q 4-97?) x 4 E (exp (ae? + x17). 
(3.10) 


ІЛ 


Using the elementary inequality exp(a(1 + y)?7?) < ехр(а2201-7)) + exp 
(a (2у)?Ч—1)), and choosing a such that a x 220—7) = c where c is the constant in 
Lemma 3.2, it follows from this lemma and estimate (3.10) that 


Ww (re < T) < exp (-ac 59077) x4 [exp(ce™) Æ c| , (3.11) 


where C is the second constant in Lemma 3.2. Now choosing n :— IVR], the 
condition under which (3.9) holds true is satisfied for R large enough. Passing to the 
limit as = — 0 in (3.7) and using (3.8), (3.9) and (3.11), we obtain 


R 
lim sup 22 log (W (Х|, = R)) < max E + 0281, -aR + e] . 
є—>0 


Letting R — oo, the conclusion follows. L| 


3.2 Weak Upper Bound 


This section is devoted to the proof of the following proposition. 


4The second derivative reads e^* ^1 x 2ae-2(1 — y) c y)? x[1-2y + 2 — y) 
a +y]. 
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Proposition 3.4 V € 9-0 N Н: 


lim sup lim sup e log W (X* € В(, R)) < —Ir(q). (3.12) 
R—0 e>0 


For every л € Н, = > О апа Фф € Qs, define 


5 


Т Т 5 
F (дуу = hrór - подо —hr | Gods | (о. = | водат), 
0 0 0 
о? Т 
-Z| Ras (3.13) 
2 Jo ` 


By setting £ = 0 in (3.13), we can define the functional F? (Ф, h). Note that F* (-, h) 
is continuous УЛ € Н оп the whole space 20 with respect to the sup-norm topology, 
and converges to F° (., h) uniformly on $259 as e — 0. 


Remark 3.5 Applying the integration by parts formula to the product л, Xf, one has 


T T 
eo | hi (X5 dB, = пох -hoi | [XE + А,Ь CXEylat 
0 0 
= һт Хт. = hoxg 
T T К 1 
-ar f wapa- | a(x- f b*(X3)ds) dt, 
0 0 0 


hence 
T 2 [ГТ 
FQ, h) = so | hs(X5)aBs => a he (XS ds. 
0 0 
According to Remark 3.5, the random variable 
&,h А 1 - z 
Мт (w) := exp aF (Xw), А) (3.14) 


is the value at time T of the local exponential martingale associated to = Jo hs (X)? 
dB;. It should be stressed that, for any Л € Н and = > 0, the functionals F*(o, h) 
and M. М (ф) are well defined for every ф є >р, and not only almost surely. 


Proof of Proposition 3.4 Since any positive local martingale is a supermartingale, 
we have 


F[M£^] < 1. (3.15) 
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Fix now a trajectory р € £229. Using the remark above: 


1 peiye 
W(X* € В(у, R)) = E [e pue Эм cena 


l 
< зир exp (-5* (0, D E (uz) 


peB(y,R) 


1 
< sup exp (= (Ф, D Р 


феВ(ұ, К) 
Since Supe p(., к) |F* (o, h) — F? (ф, h) | > 0, we have that 


lim supe? log У(Х є B(y, К)) < sup (—F (ф, А)). 
є—>0 peB(y,R) 


Therefore, by the continuity of ф œ> F (ф, h), 


limsuplimsupz?log(W(X* є B(y, К) < —F°(y,h), Whe H. 


R—0 E> 


In the next proposition we prove that: 


sup F? (4, h) = Ir(y) 
heH 


which concludes the proof of (3.12). п 


Proposition 3.6 Уу € 90 we have that: 


sup F°(y, h) = Ir() (3.16) 
heH 


Proof Assume o € Q>09MH is such that Ir (у) < oo. Then, the function u defined by 


uo = 0, ús = ф5— m 15,20 is by definition an element of H, and ¢ satisfies by con- 


struction the ODÉ (1. 2) with control u. Repeating the computations in Remark 3.5, 
one can see that 


Т 2 T 
F°(y, h) = г | hsplusds — =f h2 ds. 
0 0 


Note that F°(y, h) is concave in Л, hence if it has a critical point, this must be a 
maximum. The Fréchet differential D^ F (4%, h) at h, applied to the generical element 
k € H,reads 


T 
D' F (o, Ш = Zi ks [yas = ehe | ds. 
0 
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Therefore, D^ F? (4, h)|n=n* = 0 at any h* such that h* = Hs on {5 : Ps Æ 0} 


09; 
(while h% can take any arbitrary value on {s : Ys = 0}). For such h*, one has 


"E GRO Е О at [Ты ш» Ap 
Ft RY = | Gs Lygods— 5 | Cis) Menos = 5 | ly, gods = Ir). 


On the other hand, if y is absolutely continuous and such that [т (p) = +оо, one can 


approximate the function £s -pes with a sequence h” є H such that F (i, h”) > 
Qs 


+оо. и 


3.3 Weak Lower Bound 


This section is devoted to the proof of 


Proposition 3.7 For all p € Qso, we have 


lim inf lim mi e? log W (x° € В(, R)) > —Ir(y). (3.17) 
— E> 


In the spirit of Lamperti's transformation, we introduce the process У :— (X°)!~7. 
Y* satisfies a SDE with constant diffusion coefficient and a drift coefficient that we 
will be able to control. We will prove a large deviation weak lower bound for Y*, 
and then transfer it to X^ by means of the contraction principle. 


Proposition 3.8 Define 


1 е 
x n (Фф — ва — yu)” dt 


Try) := лү? 


for wb € Qso, where Tr (p) = +оо if (0) Z 0 or w is not absolutely continuous. 
Then, for all » such that Lr (i) < +оо, one has 


lim inf lim inf e? log W (Y* є B(w, R)) > —Ir (4). (3.18) 
R—0 e—0 


In other words, the family Y^ satisfies a large deviation weak lower bound on 
C ([0, T], R+), with rate function Tr (4). 


Once we are provided with Proposition 3.8, it is straightforward to prove the weak 
lower bound for X*. 


Proof of Proposition 3.7 Consider i» € 90 absolutely continuous. By Lemma 
3.45 in [20],  — O a.s. on (v = 0}. Therefore, Zr defined in Proposition 3.8 


can be rewritten as Zr (v) = xm T. (or Bd Yi) Lu, zodt. Using the 


1 
definition of Y^ and (3.18), since the map у +> « = 41-7 is continuous on £259, 
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we can apply the contraction principle and obtain that W (X* € .) satisfies a large 
deviation weak lower bound with rate function /7. Let us describe Ir (у) when q is 
absolutely continuous and such that /7 (к) < оо (where Ir was defined in (2.2)). Let 


V, = er. On (o = 0], one has 0 = 0 as well, while for a point ¢ in the open set 
{о > 0} such that Ф; exists, one has v; = (1 — Wer Then, noting that /r (p) < оо 


implies that A 15, «9 is integrable on [0, T], v is also absolutely continuous on [0, Т] 


(see [20, Corollary 3.41]), with derivative ji =(1- 75 1,>0 This yields 


_ 1 
Іт (р) = 7т(ф()) = hA 


T . 
| (a 95. - BAe) Modo т «oo. (319) 


1 


If I (p) = oo, there is nothing to prove in (3.17), and the claim follows. 


3.3.1 Proof of Proposition 3.8 
This section is devoted to the proof of the large deviation weak lower bound for 
the process Y* in (3.18). While postponing some of the most technical elements to 


Appendix A, we will make use here of the following notation: foreveryh є Н, y є К, 
we define 5, (Л) to be the unique solution on [0, T] of the ODE 


iy = 80 = у) +o —3)u, фо = y. (3.20) 


We denote У" the measure on © associated to the Girsanov shift ыі | Ё tdt, 


awe" w) (: К dB, — T izat) (3.21) 
ш) = ex = ===. К Е 
dW р € ЈО ы 2c? 0 { 


An application of Girsanov's Theorem shows that W (62 € ) £L wen (Х €-), 
where X*^ solves: 


» ES 1 
ахе" = BE (XS" dt + o| XE" hdt + сс| Хав, Xo" = єх. (322) 


We also define the process У" := |Х®”|!—7, 


Remark 3.9 Note that for (3.22) there exists a weak solution, which we construct 
directly from a solution of (1.1) applying Girsanov's Theorem. Since pathwise 
uniqueness holds for the couple (b, с), another application of the same theorem 
shows that pathwise uniqueness for (1.1) implies pathwise uniqueness for (3.22). 
Therefore we can always assume that X *^ solves (3.22) with the Brownian motion В. 
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Two main ingredients enter in the proof of Proposition 3.8: the convergence in 
law (under some conditions on Л) of the process Y*^ to the deterministic limit 
So (A) under the measure W (equivalently: the weak convergence of the measure 
W*^ (Y€ є.) to dso(h)), and a lower bound for the probability W (У € B(v, R)) 
depending explicitly on the relative entropy between the two measures И" and W. 
This is the content of the two following lemmas. 


Lemma 3.10 (Convergence in law of Y5^) Let h € Н be such that 


(i) So(h); > 0, Vt є (0, Т]; (її) h, > Кіпа neighborhood of 0, for some К > О. 
(3.23) 


Then, the process yer converges in law to So(h) under W, as = — О. 


Lemma 3.11 (Relative entropy bound) Let (Q, F) be a probability space and P,Q 
two probability measures on (©, F) such that dQ = FdP. The relative entropy 
H(Q|P) is defined as: 


H(Q|P) := / F log(F)dP 
Q 


Then, VA € F we have: 


=] 
1б (202) > е X HT). (3.24) 


Q(A) Q(A) 


Proof Applying Jensen's inequality, one has 


log (292) > log (/ pado. ) 
Q(A) A Q(A) 


1 1 
— — — | log(F)d ——— | (log(F)F)*aP. 
> QU) m )dQ > Qu) OR )F) 


Using the elementary fact that inf «o x log(x) > — 1; 


01 еі + H(QIP) 
Q(A) QA) ^ 


which proves (3.24). п 


/ (log(F)F)*dP > — 
A 
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The relative entropy H ("| ИУ) is easily computed using the martingale property 
of Fer = exp(4 i hsdBs — XT Js h2ds) and It6 isometry: 


1 Í fT, 
„h — pr &,h 2 
H(W*^|w) — (ri (if Ma xt Ja izat)) 
i ei 1 1 das 
=Е{-/ Fr'hdB,x- бав, —-—]| hdt 
€ JO 222 0 
T. 
mys а 55 p fidi 


" D Т» 
H (w*^iw) = =a | hat. (3.25) 


therefore 


The proof of Lemma 3.10 is postponed to Appendix A; using this lemma and 
Lemma 3.11, we can achieve here the proof of Proposition 3.8, completing the proof 
of the large deviation weak lower bound for the process X*. 


Proof of Proposition 3.8 If Zr (v) = оо, (3.18) is trivially true. Then, consider 


i € Qso such that Zr (i) < оо, and define h є Н by setting h, = Lee so 
that So(h) = v. 

Step 1. Assume that h is such that (3.23) holds true. An application of the relative 
entropy bound (3.24) with Р = W, О = №" yields 


E (e! + H(W*^|w)) 
Wer (Ye є B(v, R)) 
+ e€? log W*"^(Y* є B(v, R)). 


c? log (W (Y* є BW, R))) > 


Using "(У є B(y, R) = W*(Y*^" є B(y, R)) — 1 for every R > 0 by 
Proposition 3.10, and the expression of H( wen |W) from (3.25), taking the limit as 
€ — 0 we obtain (3.18). 
Step 2. Assume now s» € C!({0, 1]). Leth be defined as above, and define ^" є Н, 
n € N, by : : 
В? := hy +1/n. (3.26) 


We claim that Ул є N, A" satisfies (3.23). Let us first prove that condition (11) in 
(3.23) holds. Observe that  — 0 and v9 = 0 imply % > 0, һепсе hn > 1/п. 
By the continuity of h^, ensured by the fact that 1» € CD. T]), it follow that the 
condition (ii) in (3.23) holds with, say, k = 1/(2n). In order to prove condition (7), we 
observe that the comparison principle for ODEs implies that Vt € (0, T], So(h”); > 
So (I); = v; > 0; condition (i) is then proved. Furthermore, by the continuity of the 


solution to (3.20) with respect to the control parameter Л, one has 


l.So (17) — plo — 0  asn — оо. (3.27) 
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It follows from (3.27) that, for any R > 0 
W (У € Bw, R)) > W (У є B(So(h"), R/2)) (3.28) 


if n is large enough. In the first part of the proof, we have shown that the weak lower 
bound holds for W (Y* є B(So(h"), R/2)); then, taking the limits as = — 0 and 
К — 0 in (3.28), one has 


lim inf lim inf e? log W (Y* € В, R)) > —Lr(So(h")) for every n € N. 


Since Zr (So(h")) = 4 i ("yi 5 fo (h)2dt = Zr (Y), the bound (3.18) fol- 
lows. Finally, a standard density argument of C l ([0, 1]) functions in С([0, 1]) allows 
to extend the claim to any v € Qso such that Zr (0) < +оо. п 


Remark 3.12 Ina classical situation, the claim would be the lower bound (3.17) fora 
process X* satisfying, say, dX^ = b-(X*) + eo (X*)dB with Lipschitz coefficients с 
and рг — bo, and Хо = x^ — x. In this setting, fixing a control h є Н and defining 
X*^ from Х by shifting the Brownian motion B as in (3.22), it is straightforward (in 
fact: an application of Gronwall’s Lemma) to show that X^" converges in law to the 
unique solution of the deterministic limit equation dy = bo(y)dt+ с(р)аћ, po = x. 
In the present (degenerate) situation, the deterministic limit equation for the process 
XE} (obtained setting = = 0 in (3.22)) coincides with the ODE (1.2) which admits 
infinitely many solutions. When circumventing this problem by passing through the 
transformed process У", we actually show that the convergence in law of X*^ to 
a particular solution ф* of the limiting equation is restored. Indeed, assume as in 
Proposition 3.10 that л is such that the unique solution ~ of the well-posed equation 
(3.20) with y = 0 is positive for every t > 0, and У converges in law to 1. The 
function 0 is easily computed, namely v; = a(1 — y)e^0—?* h e P0—»55. ds. By 


ih. 1 

definition, one has X^^ = (У^) 177 Eun wry =: v*. By direct computation, q* 
is absolutely continuous and such that  — 0 and ф* = By* + o(y*)7h, hence * 
is a solution to (1.2); in particular, 


1 р А DL 
gr: e& (s — 9) n e BONS), qs)" E (3.29) 


Therefore, in the small noise limit, the stochastic dynamics (3.22) performs a selec- 
tion among the solutions of the limiting deterministic system (1.2), selecting the 
strictly positive one, ‹*. This looks reasonable in light of the fact that, though con- 
verging to zero, the drift parameter o^ and the initial condition x^ of the process 
remain strictly positive for all = > 0.? Figure 1 shows the convergence of simulated 


5By perturbing the initial condition and the drift in (1.2), one can retrieve the trajectory * in (3.29) 
as the limit as p — 0 of the solution of the equation dy, = p + Вр + ov) dh, po = p, for 
which existence and uniqueness hold. 
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—— =0.25 
— в=0.20 
—— в=0.15 
—— &z-0.10 
— «=0.05 


Out[8]- 


— yx 


— Discarded degenerate- ODE solution 


1 2 3 4 3 


Fig. 1 An illustration of the convergence of the process X^" in (3.22) to a particular solution * of 
the limiting deterministic sytem. Trajectories have been simulated for different values of the noise 
parameter = and y = 1/2, a(x) =1,8=0,0=2,h=1,x =0 


trajectories of the process Х to y* in (3.29) as = — 0), for a given choice of the 
control parameter Л. 


Remark 3.13 (Lower bound from the upper bound) In general, the weak conver- 
gence of the controlled process X^^ can be shown exploiting the large deviation 
upper bound. This goes as follows: in the notation of Remark 3.12, assume X* sat- 
isfies dX* = b^ (X^) + eo (X*)dB with Lipschitz coefficients, and define X^" from 
X* as in (3.22). Assume one has proven a large deviation upper bound analogous to 
(3.12) for the process X^", with a good rate function 7” depending on the control 


i E TM . x 2 . . 
parameter h, I^ (Y) := 1 I (cmm) dt. It is clear that /" admits as a 


o(p) 
unique zero the solution (Л) of wy = Бо(0;) + o Gb). Using the compactness of 
the level sets of 7” and the large deviation upper bound, it is easy to conclude that 


lim W (x # B(o(h), R)) =0 YR >0, 
e>0 


hence X^^ — (й) in law. This provides a way of “bootstrapping” the large devi- 
ation lower bound from the upper bound (via weak convergence, together with the 
bound on relative entropy in Lemma 3.11). When the limit ODE has several solu- 
tions, this approach is not possible anymore: in the present case, the rate func- 
tion 1” (y) = 1 T (=) 10, 0) has uncountably many zeroes, cor- 
responding to the possible solutions of the degenerate ODE (1.2). While one is 
expecting that converging subsequences of the family of measures {У(Х є -)}e 
converge to a probability distribution supported by the set of solutions, it is not obvi- 
ous a priori how to restore a unique limit for X^^ (which is why we pass through 
the transformed process Y ^^). When uniqueness for the limiting equation is granted, 
such an approach remains efficient, and applies outside the Markovian framework 
(see [8] for a treatment of delayed equations. In the setting of [8], uniqueness of solu- 
tions for the deterministic sytem is essential, and enters via their condition (H4)). 
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3.4 Proof of Tail Estimates 


In this section, we prove the asymptotic estimates that have been stated in Sect. 2.1 
and that follow from Theorem 2.1. 


Proof of Proposition 2.5 Setting = := R7079 into (2.1), one has 


lim sup 720—7 log W (Хт > R) = lim supe’ log W (Xj > 1) <—Р 


R—-Foo e>0 


where 


P = inf {Ir (p) : po = 0, > 0, pr = 1) 
= inf inf (Ir (9) : po = 0, > 0, pr = у} =: inf P(y). 
у>1 у>1 


Fix y > | and a function ọ in the admissible set of P (y), such that Zr (р) < oo. Set 
Ypi = er. On (o = 0], one has у = 0 as well, while for a point ¢ in the open set 
{о > 0} such that v; exists, one has v; = (1 — Der. Then, noting that /r (v) < оо 
implies that 2 lazo is integrable on [0, Т], v» is also absolutely continuous on 
[0, Е (see [20, Corollary 3.41]). Moreover, /т (v) = 51; d. (eoe сазы lo, -odt = 


Ф; 
5 a 2L m (b B(1 NY lu, ой. Noting that the inverse transformation y = 


yY a= =? also maps AC positive functions to AC positive functions (as qu > 1), one 
has 


1 T us 
Р(у) = бй үй "| f (Y — Bg(1— Yi) 1y,>0dt : w is abs. cont., 
bo = 0,0 > 0, фт = | 


When 8 = 0, the minimizer of this problem is %* (y) = y!~7t/T. When B # 0, the 
solution of the Euler-Lagrange equation associated with the Lagrangian (Y — G(1 — 
yv)? and the boundary conditions wo = 0, Yr = y!~7 yields the minimizer 


y 


* E bayt _ „—8(1—)ї 
РО = ут e Nr ý h 


In both cases, V7 (y) > 0 for all t є (0, T], and the positivity constraint in P(y) 

can be dropped. Using the monotonicity of i/* w.r.t. y, this yields infj-1 Р(у) = 
1 T 2 -— 

P(1) = eae i (t) — 8(1 — )0%(1)) dt. An application of the large 

deviation lower bound (2.1) gives lim іпЁр_> +оо 0-0 logW (Хт > А) = 

lim inf -_,9 22 log W (X5. > 1) = —infy.; P(y) = —Р(1). Finally, the explicit 
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evaluation of the integral in P(1) over the function w/* yields the expression of the 
constant ст in (2.4). 

Let us consider the running maximum process. Another application of the large 
deviation principle (2.1) with = = R-(—? gives 


lim inf 720-7) log w( sup X; > к) > —ст 
К +оо te[0, T] 


where ст inf (Ir (p) : фо = 0, ф = O,sup,eo r] 9r > 1}. Since W (sup;eo. т] 
X; > R) > W(X; > К) for every f < T, one has ст < infrejo,T] c; = CT, 
where the last identity holds for c; is a decreasing function of т. On the other hand, 
lim SUPR +00 820—7) log W (sup;epo. rj Х,> R) < cr i= —inf{Ir (p) : po = 


0, 7 0, sup;iep, T] Yr = 1}. Since 
Tr = inf [Ir (р): ро= 0, р> 0, sp р = 1, pr > o} 
te[0,T] 
a i inf(/; (9) : Ф is abs. cont. on[0, t], до = 0, Ф = 0,9; = 1} 
telu, 


IV 


= inf с; = ст 
te[0, T] 


one has ст = Cr = ст, and the claim is proved. [| 

As addressed in Sect. 2.1, Theorem 2.1 can also be used to obtain the leading- 
order asymptotics for the distribution of the time average of the process. Such a result 
can be used to derive the leading-order behavior of the implied volatility of Asian 
options E [(2 I4 Х,а — K) *] for large strike К. 


Proposition 3.14 Estimate (1.5) in Theorem 1.2 holds with ит > 0. When y = 1/2, 
the constant ит is given by 


(782+ 5E) ут8/2<1 


, (3.30) 
(782 - 3g) ут8/2>1 


ит = 


where 


the w € (0, п) such that ш cosw = T 8/2 sin(w) ifTB/2 «1 

ш= |0 Т8/2 = 1 
the w € (0, оо) such that ш cosh(w) = T 8/2sinh(w) if TB(1— y) > 1. 
(3.31) 
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Remark 3.15 Following the lines of the proof of Proposition 3.14, one can prove 


the analogous asymptotic relation for a general time-average functional i, X,p(dt), 
where u is a bounded signed measure on [0, Т]. One gets 


T 
Ww (/ X, (dt) > к) =e 077 От) аб R> 00, 
0 


where Yr is characterised by the variational formula Vr :— inf | Iro) : I NT 
(dt) > 1, > 0, Vt € [0, rj). 


Proof of Proposition 3.14 An application of th large deviation principle (2.1) 
with = := RTC? yields lim supp_, 4o 720—7 log W (4 fo Х,а > R) = 
lim sup. ,o£?log W(4 f] Xfdt > 1) < —vr, with vr = inf{Ir (р) : yo = 
0, р> 0, + [4 prdt > 1}. Proceeding as in the proof of Proposition 2.5, and in par- 
ticular exploiting the endomorphism of AC([0, T], R+) o > v = Ф!" together 
with the chain rule  — /q?1,.0, one has 


T 
VT = int| 1 (Y): yo = 0, > 0, 7], ой > | 
Т . 1. А 
Е zar] (Yri — BU — Yri) dt : Yo = 0, 
1 
veo, f byt? dt > 1 
0 
tug | 
T (coro TEA Dorr) dt : ijo = 0, 
1 
vzo, f yaz ] 
0 
1 1 . 1 . 2 
= int aint | f (^ - TBA – 0%) dt: фо = 0, 


1 
ф> 0. | gat = n| =: inf J(n). 
0 n=l 


1 ) 
Tau cs 


When y = 1/2, the latter variational problem was studied in [12, Exercise 
2.1.13]. The explicit solution for J provides the expression of the constant vr = 
inf,>1 J(]) = J(1) given in (3.30). The large deviation lower bound yields 
lim inf p+400 RO- log W(4 [у X,dt > R) = liminf 2,09€? log W(1 fy x; 
dt > 1) > —J(1) = nr, and the claim is proved. 

Consistency check with the explicit formulae for the integrated CIR process. 
Let us consider the case y = 1/2, and compare Proposition 3.14 with the moment 
explosion of the integrated CIR process, corresponding to a(x) = о > 0 in con- 
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dition (H2). We focus on the (common) case of a mean-reverting drift, i.e. 9. < 0; 
computations for 8 > 0 are similar. Estimate (1.5) establishes that I d X;dt has 
finite exponential moments up to order v7: more precisely, 


и [Т 1 [T 
u* := sup(u > 0: [exo(= / х,а) | « oo) = зир{ > 0: (2 / Х,а > х) 
Т 0 Т 0 


О(е х) as x > oo] = ит 
(3.32) 


(for the central identity, see for example [15, Sect. 4]); in other words, vr is the posi- 
tive critical exponent of T i Х,а. Critical exponents for integrated CIR have been 
assessed by [2, 14, 18] relying (essentially) on the affine structure of the process. It 
is typical to obtain u* by inverting an explicit explosion time: following [2, Corollary 
3.3], Elexp(# i. X;dr)] is always finite if u < T3?/(207), and if u > T?/(20?), 
the expectation is finite for T < T*(u) and infinite for T > T*(u), where T* reads 


т + arctan (=) 


T*(u)=2 


, 


(и) 


where y(u) = ,/20? T G2. Fixing T and using the monotonicity of T*, this means 
that the expectation becomes infinite for u > u* with u* the solution to 


192) = уш) (3.33) 


arctan 
T+ ( 2 


As an equation in ^, it is easy to see that (3.33) has a unique root 7* on К+ such that 
Y € (5, п). From the definition of y, 


и* = тё +TP) = (тё + n Е = B + 20у) 


М 
setting w* = TY. From (3.33), w* is the unique solution to w = 7 + arctan (74). 


which is equivalent to tan(w) = E together with w € G- 7): one sees that this 


definition coincides with the one for w in (3.31) (noticing we are in the first case 
when 8 < 0). 
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Appendix A 


We complete the proof of Proposition 3.2 here. 


Proof of Proposition 3.2 Let us define an auxiliary process X by 
E = So, 
dX, = |010 + o exp(-(1 — 7)1810)Х,а4В;, Xo — x; 


after a simple application of the product rule, one has that the process 4; :— 
exp(|B|t) X, is a solution to 


AZ, = (lalo ехр(1610) + I81Z:)dt + oZ? dBi, Zo = x. 


Since |о|оо exp([8|t) > lalo, an application of the comparison principle for SDE’s 
[17, Proposition 5.2.18] yields 7, > X;, for al t > 0. Therefore, if X ~” admits 
(some) exponential moments, so does gs ? and by comparison X; X207 [p this 
sense, the process X is not covered by Proposition 3.3 in [9], since the latter deals 
with the case of a diffusion coefficient that does not depend on time (see [9, Eq. 
(3.1)]); nonetheless, the essential condition that [9, Proposition 3.3] relies on is the 
presence of a non-strictly positive slope coefficient, say b in the drift term a + bX 
(cf. [9, Eq. (3.3)]). Since this is the case for the process X (which has zero slope 
coefficient b), it is straightforward to extend the proof to the present setting: in 
particular, in the spirit of Lamperti's change-of-variable argument, one still defines 
the function ф(х) = Jo a = sue 7 and studies the process Ф(Х,), where 
the function is a modification of y identically null around zero. Itó's formula 
shows that (X,+) is an Иб process with bounded quadratic variation and a bounded 
drift term; the existence of quadratic exponential moments for Ф(Х,), then, is a 
consequence of Dubins-Schwarz time-change argument and Fernique's theorem. 


; -2(1— 
As a consequence, there exist с’, С > 0 such that ѕир, <т É [exp(c’X, у < С; 


it follows ѕир, „т E F[exp(cX2 ШҮ < supper [ехр(с220- Y)) < C with c := 
с! exp( —2/8|(1 — y) T), and the claim is proved. ш 
We report the statement given in [26, Chap. 2, Theorem 2.13]. 


Lemma A.1 (Garsia-Rodemich-Rumsey's Lemma) Let p and Y be continu- 
ous, strictly increasing functions on [0, +оо) such that p(0) = (0) = 0 and 
Пт, +оо V(t) = +оо. If v € © is such that: 


LE (51 =] asas К, (A.D) 
pt — s|) 


|t—s| AK 
wr — wsl < ғ | y^ (=) dp). (А2) 
0 


then 
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Lemma A.1 allows us to prove Proposition 3.3: 


Proof of Proposition 3.3 Assume that (3.3) holds true with the left hand side replaced 
by K > 0. Applying Lemma A.1 with the choice of functions V (y) = exp(e? y) al, 
p(y) = Jy, one has for all s, t 


ko, —из| < 8 CU SE eam Мате (ЕУ 
= Ws| < A a | ар(и) = 8c o 098 qz +1Јар(и) 


it-s 
< 822 n log (4x $ т?) dp(u) 
0 


it-s 
+ f log (u?) ро 

< 8e? | iir зов (4K + т?) 
= s] | 


Dividing on both sides by (t — s)” and taking suprema we obtain 
1, < ве2 (log (ак + T?) T= 4 47127 + x). 


Since the right hand side in the last estimate is Kz} (K), (3.3) yields (3.4). п 
Finally, we prove Lemma 3.10. 


Proof of Lemma 3.10 Denote 7* the stopping time 
T : 1 ye 
T° (w) = inf {t > 0: ш < 5°Х Yi. (A.3) 


We can apply Itó formula to the function f(x) = x!—? up to time T*^(Y^^), and 
obtain 


t 
yeh _ gyl-t = BYE" аз + o(1 — yh; + eol — В, үг < Т(Ү"), as. 
0 


(A.4) 
where b® is given by 


" І 1 1 1 oyl- y) 51 
be(y) := U-ye™a(e 07» y 17») = d 5 E 2 5+Ва yy (А.5) 
y = 
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We need to prove 


lim W( sup IY?^ — Soh), R|21 VR>O. (A.6) 
e>0 te[0,T] 


In order to simplify the notation, there is no ambiguity in writing Y instead of Y®” 
inside this proof. 
Step 1. We first prove (A.6) under the assumption 


k:= inf h 20 (A.7) 
te[0, 7] 
Let us fist show that 
lim W (7° (Р) < т) =0 (A.8) 
£— 


A direct computation shows that there exist a constant c > 0 depending on x, c, a(-) 

such that: _ 

inf {6°(y) — BU - py} > се. (A.9) 
xii 


у> РА 
у256х 


Define (Z;):eq0,7] by 


t 
Z = exl? + (—сє + 0(1 – y)k)t + B(1— »[ Zsds + co(l — y) B, (A.10) 
0 


Using (A.9), it follows from the comparison principle for SDEs that 
Ү,> Z, Vt € T'(Y), as. (A.11) 


We claim that 
У(Т°(7)<Т)— 0 (А.12) 


holds true. Since W (T* (Y) x T) < W (T* (Z) x T) by (A.11), then (A.8) holds. 


We prove (A.12) later on. Now, it follows from the definition of So(A); and an 
application of Gronwall's Lemma that 


IY, — Soh); | < (e c(l— y) sup Ip, ener =:Өт VtzT*(Y), 
te[0, 7] 


therefore, for any А > 0 and = small enough 


IV 


v( sup |Y; — Soil < к) 
te 


"(| sup | |Y; — So(h)s| < өг) n(es < к) 
[0,7%] tel 


0,7 Qn] 
W ((r*() = T}N {OF x np). 


IV 
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Since both the events in the right hand side of the last inequality have probability 
converging to 1, (A.6) follows, and Lemma 3.10 is proved under condition (A.7). 

Step 2. We assume that (A.7) holds only on the time interval [0, o], that is h ‚>К 
for every t < p, for some К, p > 0. Repeating the argument of Step 1 with T = p, 
we have 


lim w( sup |Y; — So(h);| < к) =1, VR >0 (А.13) 
=—0 te[0,p] 


We apply estimate (A.13) together with a localization argument. Define a time- 
shift operator то, for every w є ©, by (тш), = шры for all t € [0, T — р]. For 
any fixed у > 0, denote X>”? the strong solution of the SDE: 


; EUN : ТТЕ СТРА 
ХУ? = упт «f БХ?) +оо во | IX? ^l ав, 
0 0 


and set 
y?^?:- (Х2?)1-7. 


Note that Y^"? is well defined since ХУ? > 0 for all t € [0, T], W-almost surely. 
If h = 0 the non negativity of the trajectories of ХУ? follows from an application 
Proposition 3.1 in [9] and extends to h € H by an application of the Girsanov 
theorem. By definition of Y and Y "^^, the Markov property yields 


E(f (tp¥)|Fp) = ECF (Уу) 


By the continuity of the map (Л, y) — $; (л) we can choose R’ > 0 such that 


sup sup |Sy (rof); — <), (то): < 
yEB(So(h)p.R’) te[0.T —p] 


s A.14 
= (А 


Therefore, using (А .14) the following inclusion of events holds (assume w.lo.g R’ < 
R 
2 


sup |Y; —So(h);| < d 2 E |, = So(h);| < 1 
te[0, T] [0.p] 


R 
N | sup |7,(Y); — Sy, (тоћ) | < | 
te[0.T — p] 
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Applying the Markov property 


w( sup |Y; — So(h);| < к) (А.15) 
te[0, T] 


Үр,р R 
ze Li sup;<to,p) IY, -SoQ),| zn) W sup |Y," — Sy, (roh);| = 2 
te[0,T —p] 


IV 


w( sup |Y; — So(h):| < к) 
1є[0,р] 


; R 
x inf w( sup |X?’ — Sy (Toh) < 2) 
yEB(So(h)p,R’) te[0,T — p] | 2 


We want to show that 


; R 
lim inf W{ sup |02 – (7л) < j=l (А.16) 
£20 yeB(SoU),,R) — Vrepo,T—gl 2 


It follows from the hypothesis So(h),; > 0 Vt > 0 and the continuity of the map 
(у, h) к> Sy(h) that, if R’, А are small enough 


R 
* = inf inf S,(7,h), -— > 0. T 
y yeB(So(),,. R^) tel0. T—p] у(трћ), 3 ( 


Define U^? as the unique strong solution of the SDE: 
yp gr 
U;" =y «f (6002?) + о(1 — Yhs+p)ds + eo(1 — 3) Bs, 
0 


where 


b* (y) ify > y* 
№0) = І c^y0-) 2 1 
2 € y* 


1 2] 1 
BAd-yyt+d-yelVale Q7» (у*) 0-7) 7 ify « y*. 
(y*) 1—y 


Then one has 


y.p R y.p R 
W sup |Y;" — Sy Tohi < | = № sup |U;" — Sy Toh) € = |. 
te[0, T — p] 2 te[0, T — p] І 2 
(А.18) 
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Now observing that b" is globally Lipschitz continuous Ve > 0 and С° := 
supyeR |b, (y) — GU — ¥)y| — 0, an application of Gronwall’s lemma gives 


( sup |U7^— S, Gul) < (C*T + 2eo(1 — y) | T) exp(|B(1 — VIT). 
te[0, T —p] 


(A.19) 


By letting = — 0 and applying the Markov inequality, observing that the right hand 
side of (A.19) does not depend on y, we have proven (A.16). By letting = — 0 in 
(A.15) and applying (A.13) and (A.16), the proof of Lemma 3.10 is complete. BI 


Proof of (A.12). Observe that Z:- 17 is an Ornstein-Uhlenbeck process, 


t 
Z, = x17 + et + BU — 9) f ЎА + o(1 - B, (А20) 
0 


where ue :— 1(—сє +0(1-– У)Ю = —c + 202) Ttis immediate by the definition 


€ 
y 


of Z that W(T'(Z) < T) = W (infe. 7 < =”). The explicit representation 


2 
of Z reads 


t 
Zp = xl 16907! +. F(t) + o(1 — y) ехр(8(1 — ү) f ехр(—8(1 — 5)5)dB, 
0 
(A.21) 


with f-(t) = —£ ot Ge) Consider a deterministic time т; with т: > 0 as 


€ — 0), to be chosen precisely later on. Noting that f- is a decreasing function, for 
Te < t € T опе has 


(A.22) 


E t 
2, = felts) cd =) f eoa - 054, 


hence, using Markov’s inequality and Doob’s inequality 


t 
v( inf zs) <»( sup «a - |f exp(—G(1 — y)s)dB, 
te te 0 


[Te,T] [7-,T] 
> felts) — su) 


< Co - » (to - x72) 


T 7 
x (/ exp(—20(1 — Das) ; 
0 
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Now, the choice Te = 4/e gives f:(r-) ^ jT: — оо as = — 0, so that 
(fe (т) — ea) — 0. On the other hand, іпЁ, єго, т.] 7, > x! a.s. as € — 0, 


hence W (int; et... 2, < x/2) — 0 аѕ = — 0, and the claim is proven. 
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Long Time Asymptotics for Optimal 
Investment 


Huyén Pham 


Abstract This survey reviews portfolio selection problem for long-term horizon. 
We consider two objectives: (1) maximize the probability for outperforming a target 
growth rate of wealth process (ii) minimize the probability of falling below a target 
growth rate. We study the asymptotic behavior of these criteria formulated as large 
deviations control problems, that we solve by duality method leading to ergodic 
risk-sensitive portfolio optimization problems. Special emphasis is placed on linear 
factor models where explicit solutions are obtained. 


Keywords Long-term investment - Large deviations * Risk-sensitive control + 
Ergodic HJB equation * Risk-sensitive control problems * Hamilton-Jacobi-Bellman 
equations * Large-time asymptotic * Large deviations 


MSC Classification (2000) 60F10 · 91G10 - 93E20 


1 Introduction 


Dynamic portfolio selection looks for strategies maximizing some performance cri- 
terion. It is a main topic in mathematical finance, first solved in continuous time in 
the seminal paper [13], and extended in various directions by taking into account 
stochastic investment opportunities, market imperfections and/or transaction costs. 
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We refer for instance to the textbooks [10, 11] or [19], and the recent survey paper 
[12] for developments on this subject. 

Classical criterion for investment decision is the expected utility maximization 
from terminal wealth, which requires to specify on one hand the utility function 
representing the investor's preference, and subjective by nature, and on the other 
hand the finite horizon. We consider in this paper an alternative behavioral founda- 
tion, with an objective criterion over long term. More precisely, we are concerned 
with the performance of a portfolio relative to a given target, and are interested in 
maximizing (resp. minimizing) the probability to outperform (resp. to fall below) 
a target growth rate when time horizon goes to infinity. Such criterion, formulated 
as a large deviations portfolio optimization problem, has been proposed by [22] in 
a static framework, studied in a continuous-time framework for the maximization 
of upside chance probability by [17], and then by [9], see also [21] in discrete-time 
models and [18] for a survey paper. The asymptotics of minimizing the downside 
risk probability is studied in [8, 15]. 

Large deviations portfolio optimization is a nonstandard stochastic control prob- 
lem, and is tackled by duality approach. The dual control problem is an ergodic risk- 
sensitive portfolio optimization problem studied in [6] by dynamic programming 
PDE methods in a Markovian setting, see also [7], and leads to particularly tractable 
results with time-homogenous policies. A nice feature of the duality approach is also 
to relate the target level in the objective probability of upside chance maximization 
or downside risk minimization to the subjective degree of risk aversion, hence to 
make endogenous the utility function of the investor. 

The rest of this paper is organized as follows. Section2 formulates the large devi- 
ations criterion. In Sect. 3, we state the general duality relation for the large devi- 
ations optimization problem, both for the upside chance probability maximization 
and downside risk minimization. We illustrate in Sect. 4 our results in the Black- 
Scholes toy model with constant proportion portfolio. Finally, we consider in Sect. 5 
a factor model for assets price, and characterize the optimal strategy of the large 
deviations optimization problem via the resolution of an ergodic Hamilton-Jacobi- 
Bellman equation from the risk-sensitive dual control. Explicit solutions are provided 
in the linear Gaussian factor model. 


2 Large Deviations Criterion 


We study a portfolio choice criterion, which is preferences-free, i.e. objective, and 
horizon-free, i.e. over long term investment. This is formulated as a large deviations 
criterion that we now describe in an abstract set-up. On a filtered probability space 
(О, F, F = (F;)r>0, P) supporting all the random quantities appearing in the sequel, 
we consider a frictionless financial market with d assets of positive price process $ = 
(S1, ..., 54). There is an agent investing at any time t a fraction 7; of her wealth in the 
assets based on the available information F;. We denote by A the set of admissible 
control strategies m = (7;);>0, and X^ the associated positive wealth process of 
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dynamics: 

ах" = Х"п!діав(8;) !d$,, t > 0, (2.1) 
where diag(S;)-! denotes the diagonal d x d matrix of i-th diagonal term 1/ Si. 
We then define the so-called growth rate portfolio, i.e. the logarithm of the wealth 
process X”: 


Li :=ln Xf, t20. 


We set by L” the average growth rate portfolio over time: 


We shall then consider two problems on the long time asymptotics for the average 
growth rate: 


(i) Upside chance probability: given a target growth rate £, the agent wants to 
maximize over portfolio strategies 7 € A 


PP[L7. > £] when T — œ. 


(ii) Downside risk probability: given a target growth rate £, the agent wants to 
minimize over portfolio strategies 7 € A 


PP[L7. < £] when T — œ. 
Actually, when horizon time T goes to infinity, the probabilities of upside chance 


or downside risk have typically an exponential decay in time, and we are led to the 
following mathematical formulations of large deviations criterion: 


1 - 
v4 (£) := sup lim sup — In P[L7 > £], (2.2) 
тєА T—oo 
v_(£) :— inf lim inf = In P[Z x £] (2.3) 
Е 6А Too T facul ` 


This criterion depends on the objective probability P, and the target growth rate £, but 
there is no exogenous utility function, and finite horizon. Large deviations control 
problem (2.2) and (2.3) are nonstandard in the literature on stochastic control, and 
we shall study these problems by a duality approach. 
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3 Duality 


We derive in this section the dual formulation of the large deviations criterion intro- 
duced in (2.2) and (2.3). Given т € А, if the average growth rate portfolio ТЛ, satisfies 
a large deviations principle, then large deviations theory states that its rate function 
I (., т) should be related to its limiting log-Laplace transform T (., т) by duality via 
the Gartner-Ellis theorem: 


I(£, т) = sup [0€ — T (6, т)], (3.1) 
0 
where J (., т) is the rate function associated to the LDP of Г: 
limsup I Р|І7 > e] = go = I(l, r), L> im, EE, (32) 


and T (., т) is the limiting log-Laplace transform of L7: 


1 тт 
Г(Ө,т) := lim sup T In Je mh OER, 


T—oo 


The issue is now to extend this duality relation (3.1) when optimizing over control 
7. 'To fix the ideas, let us formally derive from (3.1) and (3.2) the maximization of 
upside chance probability. 


1 Е 
sup lim sup T In P[LẸF > £] 


"T T=>œ 


sup [ — I(£, т)] 


= sup | — sup [04 — Г(0, 7] 
T 0 
= inf |T (0, 7) — 0€ 
supin [T (6. т) — 00] 
(if we can invert sup and inf) — i | sup Г(0, т) — oe]. 
T 
We thus expect that 
v+ (£) = inf [A+0) — 0t], (3.3) 

where A. is defined by 


A4 (0) = supT (0, 7). 


In other words, we should have a duality relation between the value function v+ of the 
large deviations control problem, and the value function A+, which is known in the 
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mathematical finance literature, as an ergodic risk-sensitive portfolio optimization 
problem. 

Let us now state rigorously the duality relation in an abstract (model-free) set- 
ting. We first consider the upside chance large deviations probability, and define the 
corresponding dual control problem: 


1 тт 
A+(@) := sup lim sup = In Ще], 0 > 0. (3.4) 


T€A T—oo 


We easily see from Hólder inequality that A. is convex on Кү. The following result 
is due to [17]. 


Theorem 3.1 Suppose that A is finite and differentiable on (0, 0) for some 0 є 
(0, co], and there exists 7(0) є A solution to A4 (0) for any 0 € (0, 0). Then, for all 
£ < A‘ (0), we have: 


0.00) = inf [A4(0) — 00]. 
0€[0,0) 


Moreover, an optimal control for 0+ (£), when € € (Л! (0), AY (0)), is 
mt’ 2d4(0(0), with A',(6(0) = £ 
while a nearly-optimal control for 0+ (£) = 0, when £ < A’, (0), is: 


noo 


1 
a = $(0,), with 6, = 0 (s. + =) — 0, 
n 


in the sense that 


1 = n 
lim lim sup T In P[LT' > £| = v4 (8). 


n— oo T=% 


Proof Step І: Let us consider the Fenchel-Legendre transform of the convex function 
A+ on [0, 0): 


Aš (£) = sup [£ - A4(0), LER. (3.5) 
0c[0,0) 


Since Л + is C! on (0, 0), it is well-known (see e.g. Lemma 2.3.9 in [4]) that the 
function A is convex, nondecreasing and satisfies: 


0(£)£ — A+ (0(0)), if Л (0) < £ < A‘, (8) 


* = 
Ae) = 0, if £ < A’, (0), 


(3.6) 
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0() — AX (€) > OOL — АЎ (Р), УА! (0) << Д, (0), VE Ae, (3.7) 


where 0(£) € (0,0) is s.t. A, (0(0) = £ € (A4 (0), Л" (0)). Moreover, A% is 
continuous on (—оо, А”, (0)). 
Step 2: Upper bound. For all £ € К, т € A, an application of Chebycheff's inequality 
yields: 


P[L7. > €] < exp(-0£T)E[exp(0T L7)], V0 [0, 0), 


and so 


1 " 1 z - 
lim sup — In P[L7. > £] < —0¢ + lim sup — In E[exp(0T L7)], V 0 e [0, 0). 
р Т р Т р Т 


T—oo T—oo 
By definitions of A4 and A5, we deduce: 


1 = 
sup lim sup — In P[L7. > £] € Л (0). (3.8) 


тєА T—oo 


Step 3: Lower bound. Consider first the case £ € (A, (0), A‘, (0)), and let us define 
the probability measure Q7 on (©, Fr) via: 


dQr _ пъ +, 
LE lec; -Tr (60), т j|. (3.9) 


where 


Гт(0, л) = InE[exp(0TL7)], 0€[0,0), тє A. 
For any є > 0, we have: 


1 a tie 1 dP 
ple == < [т <t+el]= Th (/ 39; [ceti gr) 


-0O (t) + Tr (CO, ^) 


IV 


1 = +e 
+ = InQr[t—e <" « £4 e]. 


where we use (3.9) in the last inequality. By definition of the dual problem, this 
yields: 


"TEE ы 
lim inf plese s Ly « £- €] 


T—oo 


> —0(@)(# + £) + A400) 
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1 = te 
+ lim inf > In От LEE « in < te] 
> Л (0) – Oe 


1 -— 
+ lim inf T In Qr [ —є< LU ct e| ; (3.10) 
= OO 


where the second inequality follows by the definition of A* (and actually holds with 
equality due to (3.6)). We now show that: 


1 ы 
lim inf — In Qr [e-e < EF” <tte = 0. (3.11) 
Too T 


Denote by PT the c.g.f. under Q7 of a. For all ¢ € R, we have by (3.9): 


i'r (O := InE9r[exp(CL7 )] 
= Гт(000) +С, 0*5) — Tr), at). 


Therefore, by definition of the dual control problem (3.4), we have for all ¢ € 
[—0(0), 0 — A(€)): 


1- 
ps sup FTT (O < AOL +6) — A+ (60). (3.12) 


As in part (1) of this proof, by Chebycheff’s inequality, we have for all С € [0, 0 — 
000)): 


lim sup 2 In Qr Ea 4r e| < —C(E + £) + lim sup lio 
Т- оо Т T— œ T 
< —6 (E+E) TAG 0E) — A LED, 


where the second inequality follows from (3.12). We deduce 


1 zal - 
lim sup = In От [£7 > £+ e| < = supt € - ^O : є 1000,0) 


T—oo 

— A4 (0(0£)) +000) (£ +) 
—A* (Ё +=) — A4(0(0)) + 00) (+ =), 
= —А* (+£) + А* (0) + =0(0), (3.13) 


ІЛ 
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where the second inequality and the last equality follow from (3.6). Similarly, we 
have for all ¢ € [—0(£), 0]: 


1 PT, ie 
lim sup T In Qr Lx < {— e| < —Ç (£ — £) + lim sup т17© 


T—oo T—oo 


ІЛ 


=C (£ — €) + A+ (60) + C) — A+ (000)), 


and so: 


1 zou 
lim sup = In Qr Lg" <ё—] < —зир{6 € - 9 - A«O : € € [0, 6cOD 


T—oo 
—ASdQU)) + 90) (# — е) 
= Аў (£— =) + A3 (0(0)) — €0(£). (3.14) 


ІЛ 


By (3.13) and (3.14), we then get: 


1 Sach, - +e 
lim sup = In Qr Ца" x t- e] U [z^ >£+e]] 


T—oo 


1 = + 1 es 

< max {tim sup —InQr [er mé. d ; lim sup — In Qr [er < {— 3l 
T—oo T T—oo T 

< max (- A5 (£+ e€) + Л (0) + 2000); — A5 (£ e) + АЎ (0(0) — =000)} 


« 0, 


where the strict inequality follows from (3.7). This implies that От [Er ^ < {—с} 


U iz > £+ £}] > 0 апа hence Qz[£ — = < m < L+ €] — las T goes to 
infinity. In particular (3.11) is satisfied, and by sending = to zero in (3.10), we get for 
any £ < £ < A! (0): 


MEETS 1 ane 
lim inf = арт > £l lim lim inf = InP[¢ — € < Le cte 
— 00 E> — оо 


> —A* (0). 


By continuity of AX on (—оо, A (0), we obtain 


ОО! = te á 
lim inf F Һрт > 42 А5 (0). 


This last inequality combined with (3.8) proves the assertion for v. (£) when £ € 
CA^ (0), A^, (0)). 

Now, consider the case £ < A‘, (0), and define £, = A‘, (0) +1, 0+ = 4 (0(0,)). 
Then, by the same arguments as in (3.10) with £, € (Л (0), AY. (0)), we have 
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1 = n 1 = n 
lim inf 7. In PILE” > | > lim lim inf = In P[£, — € < Ly <b, +] 


EA US). 
By sending n to infinity, together with the continuity of A5, we get 


К К " 5 1 EXE 
lim inf lim inf 2 Р | -£]z —AS(A,(0) = 0, 


noo Т—›со 


which combined with (3.8), ends the proof. 


Remark 3.1 Theorem 3.1 shows that the upside chance large deviations control prob- 
lem can be solved via the resolution of the dual control problem. When the target 
growth rate level £ is smaller than A’, (0), then one can achieve almost surely over 
long term an average growth term above £, in the sense that v+ (£) = 0, with a nearly 
optimal portfolio strategy which does not depend on this level. When the target level 
£ lies between A‘, (0) and A‘, (8), the optimal strategy depends on this level and is 
obtained from the optimal strategy for the dual control problem A ү (0) at point 0 = 
0(£). When Al (0) = oo, i.e. A4 is steep, we have a complete resolution of the large 
deviations control problem for all values of £. Otherwise, the problem remains open 
for £ > Al, (0). 


Let us next consider the downside risk probability, and define the corresponding 
dual control problem: 


1 I" 
A-(0) := inf lim inf = In е 220, (3.15) 
T 


Т- оо 


Convexity of A_ is not so straightforward as for Л +, and requires the additional 
condition that the set of admissible controls A is convex. Indeed, under this condition, 
we observe from the dynamics (2.1) that a convex combination of wealth process is 
a wealth process. Thus, for any 01, 02 € (—оо, 0), А € (0, 1), 1,7? € A, there exists 
т € A such that: 


A04 1 (1 — А) 72 - 
T Хт = Хт. 
A01 + (1 — A902 A01 + (1 — A962 


By concavity of the logarithm function, we then obtain 


Е A01 л! (1 — A)05 т? 
In X7 > In X7. + In X7 , 
(A01 + (1 — A902) (Аб + (1 — 2902) 


and so, by setting 0 = А + (1 — А) < 0: 


OTZ < TLE + (1— XÓTLT. 
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Taking exponential and expectation on both sides of this relation, and using Hólder 
inequality, we get: 


1—A 


[её] < ( [еті Du "707 ]) 
l2 


Taking logarithm, dividing by Т, sending T to infinity, and since т^,“ are arbitrary 
in A, we obtain by definition of A. : 


A-(8) € AA-(01) + (d — АЛ (62), 


i.e. the convexity of A— on R_. Since Л _ (0) = 0, the convex function Л _ is either 
infinite on (—оо, 0) or finite on IR... We now state the duality relation for downside 
risk large deviations probability, whose proof can be found in [15]. 


Theorem 3.2 Suppose that Л _ is differentiable on (—оо, 0), and there exists Т (0) 
€ A solution to A_(0) for any 0 < 0. Then, for all ё < A' (0), we have: 


v.(£) = inf [A- (6) — 64], 


and an optimal control for v— (£), when £ € (A^ (оо), A^ (0) is: 
«^ = OWO), with A (0) = £, 


while v_(£) = —oo when £ < Л! (—oo). 


Remark 3.2 Theorem 3.2 shows that the downside risk large deviations control prob- 
lem can be solved via the resolution of the dual control problem. When the target 
growth rate level £ is smaller than A^ (—оо), then one can find a portfolio strategy 
so that the average growth term almost never fall below £ over the long term, in the 
sense that v. (£) = —oo. When the target level £ lies between Л’ (—оо) and A’ (0), 
the optimal strategy depends on this level and is obtained from the optimal strategy 
for the dual control problem Л _ (0) at point 0 = 0(4). 


Interpretation of the dual problem 
For 0 Æ 0, the dual problem can be written as 


1 
— А+ (0) = sup lim sup Jr (0, л), 
0 тЄА T—oo 
with 
Jr (0,7) := E eee | 
VO ӨТ | 
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andis known in the literature as a risk-sensitive control problem. A Taylor expansion 
around 0 = 0 puts in evidence the role played by the risk sensitivity parameter 0: 


Jr (0, т) = E[L7.] + OT Var(L7) + 00°). 


This relation shows that risk-sensitive control amounts to making dynamic the 
Markowitz problem: one maximizes the expected average growth rate subject to 
a constraint on its variance. Risk-sensitive portfolio criterion on finite horizon T has 
been studied in [2, 3], and in the ergodic case T — оо, by [6, 16]. 


Endogenous utility function 
Recalling that growth rate is the logarithm of wealth process, the duality relation for 
the upside large deviations probability means formally that for large horizon T': 


pee > £] = exp (v+ (OT) 
= exp(A4(0(0)T — OOT) 
= E| (xF O OT, with 00 > 0. 


Similarly, we have for the downside risk probability: 


= 6 


р[ 27 


<= Bl (xp) |007, with ec <0. 


In other words, the target growth rate level £ determines endogenously the risk 
aversion parameter 1 —0(£) of an agent with Constant Relative Risk Aversion (CRRA) 
utility function and large investment horizon. Moreover, the optimal strategy т^“ 
for v.£ (£) is expected to provide a good approximation for the solution to the CRRA 
utility maximization problem 


sup E[(x7)*9], 
TEA 


with a large but finite time horizon. 


4 A Toy Model: The Black Scholes Case 


We illustrate the results of the previous section in a toy example, namely the Black- 
Scholes model, with one stock of price process 


d$, = S,(bd + odW,), t= 0. 


We also consider an agent with constant proportion portfolio strategies. In other 
words, the set of admissible controls A is equal to R. Given a constant proportion т 
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€ R invested in the stock, and starting w.l.o.g. with unit capital, the average growth 
rate portfolio of the agent is equal to 


" Lt 2:2 Ww 
Lp = -T = (br — ) +от T 


It follows that L? is distributed according to a Gaussian law: 
gia oln? 
= on 
К) 
un Me EE 


and its (limiting) Log-Laplace function is equal to 


г@, л) = (im оа] = ofr- a — 077-] 


e Upside chance probability. 


The dual control problem in the upside case is then given by 


-——— E оо, зл, 
HO АК = 2,193, it0x6«1, 
with 
b 
"C 
шу шее; 


Hence, Л 4+ differentiable on [0, 1) with: A‘ 40) = 2 =, and A! (1) = oo, i.e. Ay is 
steep. From Theorem 3.1, the value оар ofthe upside large deviations probability 
is explicitly computed as: 


1 
v4 (£) := sup lim SUP z — In PILE > > £] 
"cR T—oo 


abso 


0, if £« A,(0) = 
—(J A4) — V2), if £ > Л, 0) 


with an optimal strategy: 


2, if £< AQ) 


J 5, if £> А, (0). 
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Notice that, when £ < A^ (0), we have not only a nearly opum control as stated 
in Theorem 3.1, but an optimal control given by т? = b/c?, which is precisely the 
optimal portfolio for the classical Merton problem with logarithm ge function. 


Indeed, in this model, we have by the law of large numbers: LT > 2, = A’, (0), as 


T goes to infinity, and so lim7— oo T In PILZ > {| =0 = v4 (8). o when 
£ > A’, (0), the optimal strategy depends on £, and the larger the target growth rate 
level, the more one has to invest in the stock. 


e Downside risk probability. 


The dual control problem in the downside case is then given by 


pg 
A-(0) = in TO, T = T(0.8() = 775 1—5. #<0, 
«eR 0 
with 
b 

(0) = ————. 

"= 20-9 
Hence, A.. is differentiable on R- with: А” (—o0) = 0, and Л! (0) = Z5. From 


Theorem 3.1, the value function of the downside large deviations probability 18 
explicitly computed as: 

v_(€) :— inf lim inf + mP[E7 < t] 

i “eR Too T pe 


- inf [А— (0) — 6£] 


if #<0 
Erud -4ty, осил 0) = 4 


with an optimal strategy: 


-dg 26 . А 
т” =,/—, if O< €< АЛ. (0) 
Oo 


Moreover, when £ < 0, and by choosing т^ = О, we have LE — 0, so that 
PIER < £] = 0, and thus v. (£) = —oo. In other words, when the target growth 
rate Ё < 0, by doing nothing, we have an optimal strategy for v. (£). 


Remark 4.1 The above direct calculations rely on the fact that we restrict portfolio 
т to be constant in proportion. Actually, the explicit forms of the value function and 
optimal strategy remain the same if we allow a priori portfolio strategies 7 € A to 
change over time based on the available information, i.e. to be F-predictable. This 
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requires more advanced tools from stochastic control and PDEs to be presented in 
the sequel in a more general framework. L 


5 Factor Model 


We consider a market model with one riskless asset price S? = 1, and d stocks of 
price process S governed by 


dS; = diag(S;) (b(Y;)dt + a (Y;)dW;) 
dY, = n(¥;)dt + ?(Y)dW;, 


where Y is a factor process valued in IR", and W is a d + m dimensional stan- 
dard Brownian motion. The coefficients b, с, 7, у are assumed to satisfy regular 
conditions ensuring existence of a unique strong solution to the above stochastic 
differential equation, and ø is also of full rank, i.e. the d x d-matrix go’ is invertible. 

A portfolio strategy 7 is an R¢-valued adapted process, representing the fraction 
of wealth invested in the d stocks. The admissibility condition for т in A will be 
precised later, but for the moment 7 is required to satisfy the integrability conditions: 


T T 
n In; b(Y;)|dt +f |т!о(ү,)241 < oo, a.s. forall T > 0. 
0 0 


The growth rate portfolio is then given by: 


T / ^Y, T 
Lt. - | (xb) - V ота f п'о(Ү)а№,. 


For any 0 є К, and 7, we compute the Log-Laplace function of the growth rate 
portfolio: 


Tr (0, т) := In E[e?"7] 


T т 
= In ol E( f Өтүо(Ү,)а%/, Je? lo а 
0 


where £(.) denotes the Doléans-Dade exponential, and f is the function: 


1-6 


woo’ (y)r. 


РӨ, У, 7) = T'b(y) — 


We now impose the admissibility condition that 7 lies in A if the Doléans-Dade local 
martingale e( i 6n, (0d W,. " is a true martingale for any Т > 0, which is 
<t< 


ensured, for instance, by the Novikov condition. In this case, this Doléans-Dade 
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exponential defines a probability measure Q, equivalent to Р on (©, Fr), and we 
have: 


T 
Гг (0, т) = In E| exp (o | ГӨ, Y, mat) . 
0 
where Y is governed under Ол by 
dY, — (nY) T 0Y Do (Ү,)т,)аї + yY dWr. 


with W” a Q,,-Brownian motion from Girsanov's theorem. 
We then consider the dual control problems: 


e Upside chance: for 0 > 0, 


T 
A4 (0) = sup lim sup B In 1^ [ exp ef f0, Y;, m)dt)]. 
0 


тєА T—oo 


e Downside risk: for 0 < 0, 


T 
A-(0) = inf mint 2 th rt| exp ef tO, у, ndr) . 
T 0 


TEA T—oo 


These problems are known in the literature as ergodic risk-sensitive control problems, 
and studied by dynamic programming methods in [1, 5, 14]. Let us now formally 
derive the ergodic equations associated to these risk-sensitive control problems. We 
consider the finite horizon risk-sensitive stochastic control problems: 


Т 
и+(Т, у; 0) = sup 9 [ exp ef / (0, Yi, т,)ат) |Ү = y|, 0> 0 
TEA 0 


T 
u (T, y; 0) = inf 2O-[ exp of f(0, Y,, 7,)dt)|Yo = y|, 0 <0, 
TEA 0 
and by using the formal substitution: 


Inui(T, y; 0) = A«(0)T + ф+(у; 0), for large T, 


in the corresponding Hamilton-Jacobi-Bellman (HJB) equations for u+: 


ди+ " Й 1 / 
Sp = sup [0/0 у, Dus + (10) + 070)0'0)т) Dyus + tro у) Djs]. 
TER 


we obtain the ergodic HJB equation for the pair (Л + (0), p+(., 0)) as: 
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/ 1 / 2 1 / 2 
Г(®) = n Dye + zt 0) руе) + = |7 0) Dye 


+ @ sup [T OO) + 0) О) Рур) - тоо (у), 


тєв 


which is well-defined for 0 < 1. In the above equation Г (0) is a candidate for Л + (0) 
while o is a candidate solution for p+. This can be rewritten as а semi-linear ergodic 
PDE with quadratic growth in the gradient: 


1 
Г(0) = (n(y) + qe (0) bQ)).Dyg + str) Dj) 


1—0 


1 / 0 / /—1 / 
*t5Dye то) [а + GF (09) cp 00D,e 


0 / —l 
+ 5(Т— 8)” (co) b(y), (5.1) 


and a candidate for optimal feedback control of the dual problem: 


1 
$56) = OANT ODO) + 07) Dye 0)]. (5.2) 


1 


We now face the questions: 


e Existence of a pair solution (T (0), y(., 0)) to the ergodic PDE (5.1)? 
e Do we have Г(0) = ^4 (0), and what is the domain of Г? 


We give some assumptions, which allows us to answer the above issues. 


(H1) РЬ, о, папа y are smooth C? and globally Lipschitz. 
(H2) ос'(у) and уту! (y) are uniformly elliptic: there exist 1, бо > O s.t. 


&oo'(y£ < 8212, VE, y e R”, 
EY Q)€ < HIE, VE, у e R". 


ô1lél? 
бё]? 


= 
= 


(H3) There exist cı > 0 and c? > 0 s.t. 
bloo bz cily? — с, Yy €R". 
(H4) Stability condition: there exist сз > О and c4 > 0 s.t. 
(n(y) — v6 (2^) 1 BO)).y < —eslyP + ea 


According to [1] (see also [15, 20]), the next result states the existence of a smooth 
solution to the ergodic equation. 


Proposition 5.1 Under (H1)-(H4), there exists for any Ө < 1, a solution 
(T (0), p; 0)) with q(.; 0) C?, to the ergodic HJB equation s.t: 
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e For 0 < 0, 0(.; 0) is upper-bounded 
p(y; 6) —  —oo, as |y| > oo, 
e For 0 € (0, 1), Ф(.; 0) is lower-bounded 
(y; 6) — oo, as |у| ^ оо, 
and 
[Dyes O| < Co + ly). 


We now relate a solution to the ergodic equation to the dual risk-sensitive con- 
trol problem. In other words, this means the convergence of the finite horizon risk- 
sensitive stochastic control to the component Г of the ergodic equation. We distin- 
guish the downside and upside cases. 

e Downside risk: In this case, it is shown in [15] that for all 0 < 0, the solution 
(T (0), pC; 6) to (5.1), with v(., 0) C? and upper bounded, is unique (up to an additive 
constant for (.; 0)), and we have: 


Г(60) = A-(0), 0 «0. 


Moreover, there is an admissible optimal feedback control 7(., 0) for A_(@) given 
by (5.2), and for which the factor process Y is ergodic under Q+. It is also proved 
in [15] that Г = A_ is differentiable on (—оо, 0). Therefore, from Theorem 3.2, the 
solution to the downside risk large deviations probability is given by: 


v_(é) = inf [F(6) - 60], £< r'(0), 
0x0 
with an optimal control: 


m" = (00), TOW) = £, V£ € T(=), Г'(0)), 


while v. (£) = —oo for £ < l''(—oo). 

e Upside chance: In this case, 0 < 0 < 1, there is no unique solution 
(I (0), (.; 0)) to the ergodic equation, with y(.; 0) C? lower-bounded, even up 
to an additive constant, as pointed out in [6]. In general, we only have a verification 
type result, which states that if the process Y is ergodic under Q4, then 


Гө) = А+(@), 
and 7(., 0) is an optimal feedback control for Л 4 (0). 


In the next paragraph, we consider a linear factor model for which explicit calcu- 
lations can be derived. 
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5.1 Linear Gaussian Factor Model 


We consider the linear factor model: 


dS, = diag(S;)((B1Y; + Bo)dt + сам) іп R^, 
dY; = KY,dt + yd W;, in R”, 


with K a stable matrix іп R”, B, a constant d x m matrix, Во a non-zero vector in 
R’, o a d x (d + m)-matrix of rank d, and ^y a nonzero т x (d + т) matrix. We 
are searching for a candidate solution to the ergodic equation (5.1) in the quadratic 
form: 


1 
(y; 0) = 5С00)у-у + р(0)у, yeR", 


for some m х m matrices C (0) and 2 (0). Plugging this form of ọ into (5.1), we find 
that C(@) must solve the algebraic Riccati equation: 


1 0 
5€ OY (len vs 


7 o'(co') la)y C(0) 


0 1 ө 
fen 7710 (0)! Bi) C)  ; 1—5Bi 007) Bi = 0, (53) 


while B(0) is determined by 


0 0 ; 
(к + ym g^ 690 B + (atm + = 57 (2) e) C()) D(0) 


0 
z (0V € + В1) Go) Bo = 0. 


T 1 
Then, T (0) is given by: 


c' (ca^) ‘ayy D(0) 


1, Bo 6 
ГО) = (Үү CO) + 500077004 + ——3 


0 _ 1 0 = 
+ 12980000) toy DO) + = т. Во(00/) "Bo, 


and a candidate for the optimal feedback control is: 


f(y; 0) = 


1 
rs 0900 [081 + oy CO)y + Bo + ey DO). 


In [6], it is shown that there exists some positive 0 small enough, s.t. for 0 < 0, there 
exists a solution C(0) to the Riccati equation (5.3) s.t. Y is ergodic under Q;, and 
so by verification theorem, I'(0) = A+ (0). In the one-dimensional asset and factor 
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model, as studied in [17], we obtain more precise results. Indeed, in this case: d — 
m = 1, the Riccati equation is a second-order polynomial equation in C(0), which 
admits two explicits roots given by: 


к [1-6 — p34) x Va —6)0 — 88) 
I? 1—0(1— 7) ' 


С+(0) = 


for all 0 < 0, with 


miy 


= 1 
б 3^V 6 o 1-0 àp- Kjøl 


B 


where |y| (resp. |a|) is the Euclidian norm of y (resp. с), and p € [—1, 1] is the 


> 0, 


correlation between S and Y, i.e. p = M. Actually, only the solution C(0) — 
С_ (0) is relevant in the sense that for this root, Y is ergodic under Ох, and thus by 
verification theorem: 


1 1 0 
Mdb a NACE d 


+; á MDO +» $ Hy. geg 
ish i-i 05 


where 


Во 9(ply1C- (9) + iat) 
K\o| V0 -00 — 08) ' 


and with optimal control for A+ (0) given by: 


Dey = 


^. . — 1 — 
80:9 = goga lla nO) + E + omo], 


Moreover, it is also proved in [17], that 


В В? || 


rus — 
(0) 2102 40|2К 


(recall that К « 0) and the function Г is steep, i.e. 


lim Г'(0) = 
040 
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From Theorems 3.1 and 3.2, the solutions to the upside chance and downside risk 
large deviations probability are given by: 


v4(£) = inf. [г00) — 0t], LER, 
0<0<0 
v_(é) = inf [T(0) – 0], £< Г"), 
0x0 
with optimal control and nearly optimal control for v4 (£): 


x = #(Y,; 0(0), TOO) = £, when £ > Г'(0), 


lions 
ai” = #(¥;; On), with 6, = 6(T(0) + -) —5 0,  when£ < Г'(0), 
n 
and optimal control for v— (£): 


m; =â Ө(#)), TOO) = & Yee (Г'(—оо), (0). 


5.2 Examples 


o Black-Scholes model. This corresponds to the case where B, = О. Then, 
B —0-—1,C.(0) = D(0) = 0, and so 
10 B 
A40) = T(0) = 2————, уӨб<1. 
„@ = ГӨ = e ##< 


We thus obtain the same optimal strategy as described in Sect. 4. 


e Platen-Rebolledo model. In this model, the logarithm of the stock price S is 
governed by an Ornstein-Uhlenbeck process Y, and this corresponds to the case 
where В| = К <0, Во = |? > 0, y = c, and thus p = 1. Then, 8 = 0, 0 = 1, 


K 1 
С_(0) = (ЧҮШ тє | D(0) = —-0, 
lc? 2 
and so 
K 2 
re = p Үл б] +67, g<1, 
_ К 2 2 
Г'(0) = £ := а Ie Г'(-оо) = £ := Ier 
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oe =1- (4), мее 


The solution to the upside chance large deviations probability is then given by: 


Е y itx 
v+ (£) = +9 - 
0,  if£«£ 


with optimal (resp. nearly optimal) portfolio strategy: 


К —4(£—£ 1 = 
np = ( 5, if £2 


lc]? 
К — 1/п 1 = 
+(n) ; 
rr— yY, —- f £t. 
t lel? +» 1 = 


The solution to the downside risk large deviations probability is given by: 


(0—02. 7 
о(6) = geb if #<{8{<{ 
оо, if б<, 


with optimal portolio strategy: 


Е 400 — £) 1 

£ ne 

ICI = рш. 
í |a|? dii 


References 


1. Bensoussan, A., Frehse, J.: On Bellman equations of ergodic control іп RY. J. Reine Angew. 
Math. 429, 125-160 (1992) 

2. Bielecki, T.R., Pliska, S.R.: Risk-sensitive dynamic asset management. Appl. Math. Optim. 
39, 337—360 (1999) 

3. Davis, M., Lleo, S.: Risk-sensitive benchmarked asset management. Quant. Financ. 8, 415—426 
(2008) 

4. Dembo, A., Zeitouni, O.: Large Deviations Techniques and Applications, 2nd edn. Springer, 
New York (1998) 

5. Fleming, W., McEneaney, W.: Risk sensitive control on an infinite horizon. SIAM J. Control 
Optim. 33, 1881—1915 (1995) 

6. Fleming, W., Sheu, S.J.: Risk sensitive control and an optimal investment model. Math. Financ. 
10, 197—213 (2000) 

7. Guasoni, P., Robertson, S.: Portfolios and risk premia for the long run. Ann. Appl. Probab. 22, 
239—284 (2012) 

8. Hata, H., Nagai, H., Sheu, S.J.: Asymptotics of the probability minimizing a down-side risk. 
Ann. Appl. Probab. 20, 52—89 (2010) 


528 H. Pham 


9. Hata, H., Sekine, J.: Solving long term investment problems with Cox-Ingersoll-Ross interest 

rates. Adv. Math. Econ. 8, 231—255 (2005) 

10. Karatzas, I., Shreve, S.: Methods of Mathematical Finance. Springer, New York (1998) 

11. Korn, R.: Optimal Portfolios: Stochastic Models for Optimal Investment and Risk Management 
in Continuous-time. World Scientific, Singapore (1997) 

12. Liu, R., Muhle-Karbe, J.: Portfolio choice with stochastic investment opportunities: a user's 
guide, Preprint (2013) 

13. Merton, R.: Optimum consumption and portfolio rules in a continuous-time model. J. Econ. 
Theory 3, 373-413 (1971) 

14. Nagai, H.: Bellman equations of risk sensitive control. SIAM J. Control Optim. 34, 74-101 
(1996) 

15. Nagai, H.: Downside risk minimization via a large deviation approach. Ann. Appl. Probab. 22, 
608—669 (2012) 

16. Nagai, H., Peng, S.: Risk-sensitive dynamic portfolio optimization with partial information on 
infinite time horizon. Ann. Appl. Probab 12, 173-195 (2002) 

17. Pham, H.: A large deviations approach to optimal long term investment. Financ. Stoch. 7, 
169-195 (2003) 

18. Pham, H.: Some applications and methods of large deviations in finance and insurance. Paris- 
Princeton Lectures on Mathematical Finance, Lecture Notes in Mathematics, vol. 1919 (2007) 

19. Pham, H.: Continuous Time Stochastic Control and Optimization with Financial Applications. 
SMAP. Springer, New York (2009) 

20. Robertson, S., Xing, H.: Large time behavior of solutions to semi-linear equations with 
quadratic growth in the gradient. SIAM J. Control Optim. 53(1), 185-212 (2015) 

21. Stettner, L.: Duality and risk-sensitive portfolio optimization. In: Yin, G., Zhang, Q. (eds.) 
Mathematics of Finance, Contemporary Mathematics, vol. 351, pp. 333-347 (2004) 

22. Stutzer, M.: Portfolio choice with endogenous utility: a large deviations approach. J. Econom. 
116, 365-386 (2003) 


Systemic Risk and Default Clustering 
for Large Financial Systems 


Konstantinos Spiliopoulos 


Abstract As it is known in the finance risk and macroeconomics literature, 
risk-sharing in large portfolios may increase the probability of creation of default 
clusters and of systemic risk. We review recent developments on mathematical and 
computational tools for the quantification of such phenomena. Limiting analysis such 
as law of large numbers and central limit theorems allow to approximate the distri- 
bution in large systems and study quantities such as the loss distribution in large 
portfolios. Large deviations analysis allow us to study the tail of the loss distrib- 
ution and to identify pathways to default clustering. Sensitivity analysis allows to 
understand the most likely ways in which different effects, such as contagion and 
systematic risks, combine to lead to large default rates. Such results could give useful 
insights into how to optimally safeguard against such events. 


Keywords Systemic risk - Default clustering * Large portfolios * Loss distribution - 
Asymptotic methods - Rare events 


1 Introduction 


The past several years have made clear the need to better understand the behaviour in 
large interconnected financial systems. Almost all areas of modern life are touched 
by a financial crisis. The recent financial crisis of 2007—2009 brought into focus the 
networked structure of the financial world. It challenged the mathematical finance 
community to understand connectedness in financial systems. The understanding of 
systemic risk, i.e., the risk that a large numbers of components of an interconnected 
financial system fails within a short time leading to the failure of the system itself, 
becomes an important issue to investigate. 

Interconnections often make a system robust, but they can also act as conduits 
for risk. Even things that may seemingly be unrelated, may become related as risk 
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restrictions, may for example, force a sale of one type of a well-performing asset to 
compensate for the poor behavior of another asset. Thus, appropriate mathematical 
models need to be developed, in order to help in the understanding of how risk can 
propagate between financial objects. 

Itis possible that initial shocks could trigger contagion effects (e.g., [1]). Examples 
of such shocks include: changes in interest rate values, in currencies values, changes 
of commodities prices, or reduction in global economic growth. Then, there may bea 
transmission mechanism which causes other institutions in the system to be affected 
by the initial shock. An example of such a mechanism is financial linkages among 
economies. Another reason could simply be investor irrationality. In either case, 
systemic risk causes the perceived risk-return trade-off in the economy to change. 
Uncertainty becomes an issue and market participants fear subsequent losses in 
asset prices with a large dispersion in regards to the magnitude of the crisis. Reduce- 
form point process models of correlated default are many times used (a): to assess 
portfolio credit risk and (b): to value securities exposed to correlated default risk. 
The workhorses of these models are counting processes. In this work we focus on 
using dynamic portfolio credit risk models to study large portfolio asymptotics and 
default clustering. 

Large portfolio asymptotic were first studied in [2]. The model in [2] is a sta- 
tic model of a homogeneous pool and firms default independently of one another 
conditional on a normally distributed random variable representing a systematic risk 
factor. Alternative distributions of the systematic factor were examined in [3, 4] and 
the case of heterogeneous portfolios was studied in [5]. In [6], the authors extend the 
model of [2] dynamically and the systematic risk factor follows a Brownian motion. 
In [6], the authors study a structural model for distance to default process in a pool 
of names. À firm defaults when the default process hits zero. Exploiting conditional 
independence of defaults, [7, 8] have studied the tail of the loss distribution in the 
static case. Large deviations arguments were also used in [9] to study stochastic 
recovery effects on large static pools of credit assets. 

Reduced-form models of correlated default timing have appeared in the finance 
literature under different forms. Giesecke and Weber [10] take the intensity of a 
name as a function of the state of the names in a specified neighborhood of that 
name. The authors in (11, 12] take the intensity to be a function of the portfolio 
loss and each name can be either in a good or in a distressed financial state. These 
papers prove law of large numbers for the portfolio loss distribution and develop 
Gaussian approximations to the portfolio loss distribution based on central limit 
theorems. Cvitanié et al. [13] consider the typical behavior of a mean field system 
with permanent default impact. 

Sircar and Zariphopoulou [14] study large portfolio asymptotics for utility indif- 
ference valuation of securities exposed to the losses in the pool. In [15], the authors 
study systematic risk via a mean field model of interacting agents. Using a model 
of a two well potential, agents can move freely from a healthy state to a failed state. 
The authors study probabilities of transition from the healthy to the failed state using 
large deviations ideas. In [16] the authors propose and study a model for inter-bank 
lending and study its stochastic stability. 
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The authors in [17] employ jump-diffusion models driven by Hawkes processes 
to empirically study default clustering and the time dimension of systemic risk. 
Duan [18] proposes a hierarchical model with individual shocks and group specific 
shocks. The work of [19] reviews intensity models that are governed by exogenous 
and endogenous Markov Chains. In [20], the authors proposed a dynamic point 
process model of correlated default timing in a portfolio of firms (“names”). The 
model incorporates different sources of default clustering identified in recent empir- 
ical research, including idiosyncratic risks, exposure to systematic risk factors and 
contagion in financial markets, see [21, 22]. Based on the weak convergence ideas of 
[20], the authors in [23] obtain and study formulas for the bilateral counterparty val- 
uation adjustment of a credit default swaps portfolio referencing an asymptotically 
large number of entities. 

The model in [20] can be naturally understood as an interacting particle system 
that is influenced by an exogenous source of randomness. There is a central source of 
interconnections and failure of any of the components stresses the central ‘bus’, which 
in turn can cause the failure of other components (a contagion effect). Computing 
the distribution of the loss from default in such models tends to be a difficult task 
and while Monte-Carlo simulation methods are broadly applicable, they can be slow 
for large portfolios or large time horizons as it is commonly the interest in practice. 
Mathematical and computational tools for the approximation to the distribution of 
the loss from default in large heterogeneous portfolios were then developed in [24], 
Gaussian correction theory was developed in [25] and analysis of tail events and 
most likely paths to failure via the lens of large deviations theory was then developed 
in [26]. We remark here that to a large extend systemic risk refers to the tail of the 
distribution. The authors in [27] combine the large pool asymptotic results of [1, 
3, 4, 9, 10, 24—34] with maximum likelihood ideas to construct tractable statistical 
inference procedures for parameter estimation in large financial systems. 

Such mathematical results lead to new computational tools for the measurement 
and prediction of risk in high-dimensional financial networks. These tools mainly 
include approximations of the distribution of losses from defaults and of portfolio 
risk measures, and efficient computational tools for the analysis of extreme default 
events. The mathematical results also yield important insights into the behavior of 
systemic risk as a function of the characteristics of the names in the system, and in 
particular their interaction. 

Financial institutions (banks, pension funds, etc.) often hold large portfolios in 
order to diversify away a number of idiosyncratic effects of individual assets. Deposit 
insurance premia depend upon meaningful models and assessment of the macroeco- 
nomic effect of the various phenomena that drive defaults. Development of related 
mathematical and computational tools can help inform the design of regulatory pol- 
icy, improve the pricing of federal deposit insurance, and lead to more accurate risk 
measurement at financial institutions. 

In this paper, we focus on dynamic default timing models for large financial 
systems that fall into the category of intensity models in portfolio credit risk. Based 
on the default timing model developed in [20], we address several of the issues just 
mentioned and that are typically of interest. The mathematical and computational 
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tools developed allow to reach to financial related conclusions for the behavior of 
such large financial systems. 

Although the primary interest of this work is risk in financial systems, models of 
the type discussed in this paper are generic enough to allow for modifications that 
make them relevant in other domains, including systems reliability, insurance and 
epidemiology. In reliability, a large system of interacting components might have 
a central connection, and be influenced by an external environment (temperature, 
for example). The failure of an individual component (which could be governed by 
an intensity model appropriate for the particular application) increases the stress 
on the central connection and thus the other components, making the entire system 
more likely to fail. In insurance, the system could represent a pool of insurance 
policies. The effect of wildfires might, in that example, be modelled by a contagion 
term. Systematic risk in the form of environmental conditions has an impact on the 
whole pool. 

The rest of the article is structured as follows. In Sect. 2 we describe the correlated 
default timing proposed in [20]. Section3 studies the typical behavior of the loss 
distribution in such portfolios as the number of names (agents) in the pool grow to 
infinity. Section 4 focuses on developing the Gaussian correction theory. As we shall 
see there, Gaussian corrections are very useful because they make the approximations 
accurate even for portfolios of relatively small sizes. In Sect.5, we study the tail of 
the loss distribution using arguments from the large deviations theory. We also study 
the most likely path to systemic failure and to the development of default clusters. 
An understanding of the preferred paths to large default rates and the most likely 
path to the creation of default clusters can give useful insights into how to optimally 
safeguard against such events. Importance sampling techniques can then be used to 
construct asymptotically efficient estimators for tail event probabilities, see Sect. 6. 
Conclusions are in Sect.7. A large part of the material presented in this work, but 
not all, is related to recent work of the author described in [20, 24—26]. 


2 A Dynamic Correlated Default Timing Model 


One of the issues of fundamental importance in financial markets is systemic risk, 
which may be understood as the likelihood of failure of a substantial fraction of firms 
in the economy. There are a number of ways of interpreting this, but our focus will 
be the behavior of actual defaults. Defaults are discrete events, so one can frame the 
interest within the language of point processes. Empirically, defaults tend to happen 
in groups; feedback and exposure to market forces (along the lines of "regimes") 
tend to produce correlation among defaults. 

Let us fix a probability space (©, .2, Р) where all random variables will be 
defined. Denote by t” the stopping time at which the nth component (or particle) in 
our system fails. Then, as 6 N 0, a failure time т” has intensity process A", which 
satisfies 

P{t" €(t,t+6]|F;,, v" > t) m avs, (1) 
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where F, is the sigma-algebra generated by the entire system up to time t. Hence, we 
essentially have that the process defined by 1172 <,;, — d A le ss)ds is a martingale. 

Motivated by the empirical studies in [21, 22], we may model the intensity A" in 
such a way that it depends on three factors: a mean reverting idiosyncratic source 
of risk, the portfolio loss rate and a systematic risk factor. Heterogeneity can be 
addressed by allowing the intensity parameters of each name to be different. The 
mean reverting character of the idiosyncratic source of risk is there to guarantee that 
the effect of a default in the pool has a transient effect on the default intensities of the 
surviving names. The dependence on the portfolio loss rate, denoted by L is the 
term that is responsible for the contagious effects, whereas the systematic risk factor, 
denoted by X. is an exogenous source of risk. To be precise, the default intensi- 
ties, A"'s, are governed by the following interacting system of stochastic differential 
equations (SDEs) 


аА = —оһ@Л — Аһ)? + олау! + BCALN + cpSrPdX,, А = А". (2) 


where, (W"],ew be a countable collection of independent standard Brownian 
motions. 
The process LN represents the empirical failure rate in the system, i.e., 


1 N 
LY = uen (3) 


n=1 


where by letting {en}nen to be an i.i.d. collection of standard exponential random 
variables we have 


t 
tint {r= 0: f ids =en). (4) 
AY 


=0 


The process X; represents the systematic risk, which can be modeled to be the 
solution to some SDE 


dX, = bo(X;)dt + o9(Xi)dV,, Хо = xo. (5) 


where V is a standard Brownian motion which is independent of the W”’s and е„”$. 
Plausible models for X; could be an Ornstein-Uhlenbeck process or a Cox-Ingersoll- 
Ross (CIR) process. 

In the case pe = 8? = 0 for all n € {1,..., №}, one recovers the classi- 
cal CIR process model in credit risk, e.g., [35]. Namely, the intensity SDE (1) 
extends the widely-used CIR process by including two additional terms that gen- 
erate correlation between failure times. The term £85 47d X; induces correlated dif- 
fusive movements of the component intensities; the process X represents the state 
of the macro-economy, which affects all assets in the pool. The term pe d EN intro- 
duces a feedback (contagion) effect. The standard term —о„ (À? — Àn)dt is a mean 
reverting term allowing the component to “heal” after a shock (i.e., a failure). This 
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parsimonious formulation allows us to take advantage of the wealth of knowledge 
about CIR-type processes. The parameter є > 0 allows us to later on focus on rare 
events. 

The process L" of (3), which simply gives us the fraction of components which 
have already failed by time t, affects each of the remaining components in a natural 
way. Each failure corresponds to a Dirac function in the measure dL; the term 
BE dL" thus leads to upward impulses in A"'s, which leads (via (4)) to sooner 
failure of the remaining functioning components. We might think of a central “bus” 
in a system of components. Each of the components depends on this bus, which in 
turn sensitive to failures in the various components. In the financial application that 
was considered in [20], this feedback mechanism is empirically observed to be an 
important channel for the clustering of defaults in the U.S. (see [21]). 

In order to allow for heterogeneity, the parameters in (2) depend on the index n. 
Define the “type” 


p? = (AF, өл, Ал, On, BE, бз) (9 


for each n є N and > 0. The р” take value in P = R$ x Rx R, x RC RÓ. The 
parameters (A2, Qn, А Ons BE, B5) are assumed to be bounded uniformly in n € N. 

We can capture the heterogeneity of the system by defining Uy — x > 9p, 
and assuming that this empirical type frequency has a (weak) limit. In particular we 
make the following assumption 


Assumption 2.1 We assume that U = іту Uy exists (in A(P)). 


Proposition 3.3 in [20] guarantees that under the assumption of an existence of a 
unique strong solution for the SDE for X. process, the system (2)-(5) has a unique 
strong solution such that А” > 0 for every N € N, n є (1,..., N} and t > 0. The 
model (2)-(5) is a mean-field type model; the feedback occurs through the empirical 
average of the pool of names. It is somewhat similar to certain genetic models (most 
notably the Fleming- Viot process; see [36], [37, Chap. 10], and [38]). However, as it 
is also demonstrated in [20, 24], the structure of the system (2)-(5) presents several 
difficulties that bring the analysis of such systems outside the scope of the standard 
setup. 


3 Typical Behavior: Law of Large Numbers 


The system (2)-(5) can naturally be understood as an interacting particle system. This 
suggests how to understand its large-scale behavior. The structure of the feedback (the 
empirical average L) is of mean-field type (roughly within the class of McKean- 
Vlasov models; see [31, 39]). An understanding of "typical" behavior of a system 
as N — oo is fundamental in identifying "atypical" or "rare" events. 
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To formulate the law of large numbers result, we define the empirical distribution 
of the p"'s corresponding to the names that have survived up to time ft, as follows: 


1 N 
N 
M, = N > дру 1{тл>г}. 
п=1 


This captures the entire dynamics of the model (including the effect of the hetero- 
geneities). We can directly calculate the failure rate from the ш °: 


LN =1- ud (P), t20. (7) 
Let us then identify the limit of ШУ (P) as N — oo. This is a law of large numbers 


(LLN) result and it identifies the baseline "typical" behavior of the system. For 
f € C?(P), let 


EN 8f -ôf 
(Li J) = 507-5 (р) — a — А) (p) — Af (p) 
EP 
LNP = 8° (p) 
д zs д2 
(L3 f)(p) = ep abo x) 2 (p + = 0530400 7 p) (8) 


д 
CINP = +89000) 2 (p and Q(p)=A 


for p = (A, a, А, c, ВС, 85). The generator L1 corresponds to the diffusive part of 
the intensity with killing rate A, and £2 is the macroscopic effect of contagion on 
the surviving intensities at any given time. The operators Z3 and £4 capture the 
dynamics due to the exogenous systematic risk X. Then u™ tends in distribution (in 
the natural topology of subprobability measures on 7?) to a measure-valued process 
L. Letting 


(fu) = f урушар) 
рє? 


for all f € C?(P), the limit 1 satisfies the stochastic evolution equation 


4 (fs ho) = [C1 f, Be) + (Q, fia) of. fir) + (C5" f is) dt e (E f D) dVi аз. 
(9) 


With sufficient regularity, this is equivalent to the stochastic integro-partial differen- 
tial equation (SIPDE) 


du = £iudt + (/ ov) Гой + Lx dt + ely *vdV, a.s. (10) 
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where * denotes adjoint in the appropriate sense (for notational simplicity, we have 
written (10) to include the types as one of the coordinates; in a heterogeneous col- 
lection in practice we would often use only A in solving (10)). We recall the rigorous 
statement in Theorem 3.1. 

The SIPDE (10) gives us a “large system approximation" of the failure rate: 


LY s1i-B(P)z1 - f va, p)dp. (11) 


The computation of the first-order approximation (11) suggested by Ше LLN requires 
solving the SIPDE (10) governing the density of the limiting measure. In [24] a 
numerical method for this purpose is proposed. The method is based on an infinite 
system of SDE's for certain moments of the limiting measure. These SDEs are 
driven by the systematic risk process X and a truncated system can be solved using 
a discretization or random ODE scheme. The solution to the SDE system leads to 
the solution to the SIPDE via an inverse moment problem. 

The approximation (11) has significant computational advantages over a naive 
Monte Carlo simulation of the high-dimensional original stochastic system (2)-(5) 
and its accuracy is demonstrated in the left of Fig. 1 fora specific choice of parameters. 
It also provides information about catastrophic failure. 

The tail represents extreme default scenarios, and these are at the center of risk 
measurement and management applications in practice. The analysis of the limiting 
distribution generates important insights into the behavior of the tails as a function of 
the characteristics of the system (2)-(5). For example, we see that the tail is heavily 
influenced by the sensitivity of a name to the variations of the systematic risk X. The 
bigger the sensitivity the fatter the tail, and the larger the likelihood of large losses 
in the system (see the right of Fig. 1). Insights of this type can help understand the 
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Fig.1 Onthe/eft Comparison of distributions of failure rate LN for different N att = 1. Parameter 
choices: (о, a, А, Хо, ВС, 8°) = (0.9, 4, 0.2, 0.2, 4, 8). On the right Comparison of distribution of 
limiting failure rate 1 — ji; (7) for different values of the systematic risk sensitivity B at t = 1. 
Parameter choices: (о, a, A, А0, Bo) = (0.9, 4, 0.2, 0.2, 2) 
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role of contagion and systematic risk, and how they interact to produce atypically 
large failure rates. This, in turn, leads to ways to minimize or “manage” catastrophic 
failures. 

Let us next present the statement of the mathematical result. We denote by E the 
collection of sub-probability measures (1.е., defective probability measures) on P; 
i.e., E consists of those Borel measures v on 7 such that v(P) < 1. 


Theorem 3.1 (Theorem 3.1 in [24]) We have that u™ converges in distribution to 
р. in Dg[O, T]. The evolution of ji. is given by the measure evolution equation 


d (f, Rog = Us f йк + (О, йк of Bi + (LX f. no), ] at 
(С fiu), av. YS € С) аз. 


Suppose there is a solution of the nonlinear SPDE 


ava p | ivt. p e vs p ( „Оов, рар) ive. p d 


(12) 


p'e 


+ £7", p)dV;, t>0, pe? 


where C? denote adjoint operators, with initial condition 
li t, p)dp = U(dp). 
A u(t, p)dp (dp) 
Then 


We close this section, by briefly describing the method of moments that leads to 
the numerical computation of the loss from default. We focus our discussion on the 
homogeneous case and we refer the reader to [24] for the general case. 

Firstly, we remark that the SPDE (12) can be supplied with appropriate boundary 
conditions, which as it is mentioned in [24], are 


v(t, à = 0) = v(t, A = oo) = 0. 


Secondly, it turns out that for k є N, the moments ид (t) = b Mu(t, Ada exist 
almost surely. By (11) is clear that we want to compute uo(t). In particular, note that 
the limiting loss L; = 1 — uo(t). 
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By an integration by parts and using the boundary conditions at A = 0 and at 
à = оо, ме can prove that they follow the following system of stochastic differential 
equations 


duy(t) = {ик(@)( — ak + B bo(X;)k + 0.5(85? og (Xi)k(k — 1)) 
+ uci (t)(0.50°k(k — 1) + «АК + BC Kus (7) — uii ())dt 
+ B5oo(X;)kuy(t)dV;, (13) 


ug (0) = n B AFA (dA, 
0 


where Ao(A) = limyoo у У 5yn(A). 

The system (13) is a non-closed system since to determine ик (t), one needs to 
know ux+1(t). So, in practice one must perform a truncation at some level k = К 
where we let ux+1 = ик (that is, we use the first К + 1 moments). As itis shown in 
[24] one needs relatively small numbers of moments in order to compute the zero-th 
moment ио (7) with good accuracy. Then, by solving backwards, one computes uo(t) 
and from this one gets the limiting loss distribution 


Lı = 1 — uo(t). 


4 Central Limit Theorem Correction 


The asymptotics of (10) give via (11) the limiting behavior of the system as the 
number of components becomes large. Starting with that result, the results in [25] 
develop Gaussian fluctuation theory analogous to the central limit theory (see for 
example [11, 12, 32, 40] for some related literature). This result provides the leading 
order asymptotics correction to the law of large numbers approximation developed 
in Sect. 3. In practical terms, the usefulness of such of a result is twofold: (a) the 
approximation is accurate even for portfolios of moderate size, see [25], and (b) 
one can make use of the approximation to develop tractable statistical inference 
procedures for the statistical calibration of such models, see [27]. 
To be more precise, let us define the signed measure 


aN = VN {ul - i]: 


as N — oo. Conditional on the exogenous systematic risk process X, a central limit 
theorem applies and E = іту. E" exists in an appropriate space of distributions 
and is Gaussian. Unconditionally, it may not be Gaussian but is of mean zero (since 


we have removed the bias д from ш). 
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The usefulness of the fluctuation analysis is that it leads to a second-order approx- 
imation to the distribution of the portfolio loss L^ in large pools. The fluctuations 
analysis yields an approximation which improves the first-order approximation (11) 
suggested by the LLN, especially for smaller system sizes N. 

In particular, Theorem 4.1 implies that 


P(VN(L} — 1) > 0) ~ P(& (P) < -0 
for large N. This motivates the approximation 


1 а 1 
N т № - 

и, = — E; du FR 

: VN : i VN 


which then implies the following second-order approximation for the portfolio loss. 


or + Mr, 


d = 
LP Т, = E, (P). (14) 


1 
JN 

The numerical computation of the second-order approximation (14) suggested 
by the fluctuation analysis is amenable to a moment method similar to that used for 
computing the first-order approximation (11). In addition to solving the LLN SIPDE, 
we would also need to solve for the fluctuation limit. This limit is governed by a 
stochastic evolution equation, which gives rise to an additional system of “fluctuation 
moments.” This system is driven by the exogenous systematic risk process X and 
the martingale M, in Theorem 4.1 that is conditionally Gaussian given X. 

Left of Fig. 2 compares the approximate loss distribution with the actual loss dis- 
tribution for specific parameter choices. It is evident from the numerical comparisons 
that the second-order approximation has increased accuracy, especially for smaller 
portfolios and in the tail of the distribution. The right of Fig.2 compares for the 95 
and 99 percent value at risk (VaR) between the actual loss, LLN approximation (11), 
and approximation (14) for a pool of N = 1,000 names. It is also evident from the 
figure that the approximation for the VaR based on (14) is much more accurate than 
the law of large numbers approximation. 

Let us close this section, with a few words on the actual mathematical result. It 
turns out that the convergence © = іту со E^ happens in an appropriate weighted 
Hilbert space, which we denote by Wi (w, p), with w and p the appropriate weight 
functions, J € N and Wy Tw, p) will be its dual. Such weighted Sobolev spaces 
were introduced in [33] and further generalized in [41] to study stochastic partial 
differential equations with unbounded coefficients. These weighted spaces turn out 
to be convenient for the present situation, see [25]. 


540 K. Spiliopoulos 


Actual Loss Distribution 
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Fig. 2 On the left Comparison of approximate and actual loss distributions of failure rate LN 
for different N at t = 0.5. Parameter choices: (o, œ, А, Ao, BS, 8°) = (0.9,4, 0.2, 0.2, 1, 1). On 
the right Comparison of approximate and actual VaR. Parameter choices: (о, a, А, Ao, BS, 8° ) = 


(0.9, 4, 0.2, 0.2, 1, 1). In both cases, X is an OU process with reversion speed 2, volatility 1, initial 
value 1 and mean 1 


In order to state the convergence result, we introduce some operators. Let p € 
P c RÓ and for f є С} (Р), define 


(Gs. PP) = Gif) (p) + EZ (p) + (Q, и) La f)(P) + (a f. и) QP) 
о е 
(sf. Ф) Ф) = a^ (р) = (A 


(LE(S, g))(p) = f (p)g(p) 
(бл fp) = f(p)a 


Then, we have the following theorem related to the fluctuations analysis. 


Theorem 4.1 (Theorem 4.1 in [25]) For J > 0 large enough and for appropriate 

weight functions (w, p), the sequence (EN, t € [0, T]] уем is relatively compact 

in Dy- w 5] [0, T]. For any f € Wo (ш, p), the limit accumulation point of EN. 
0 , 


denoted by E, is unique in йу” (ш, о) and satisfies the stochastic evolution equation 


[4 t 
(f, &) = (f. Bo) + n Ox, a, f. Es)ds + n (ef f, & av, + (у, М}, аз. 
0 0 
E (15) 
for any f € wg (w, p), where M is a distribution-valued martingale with pre- 
dictable variation process 


(f, М) = Í (LsL, P), Bs) + (Lel, P), Bs) + Ua f. BS (Q Ñs) 


—2 (£5 f, As) (C2 f, Ms) ds. 


Systemic Risk and Default Clustering for Large Financial Systems 541 


Conditional on the o -algebra V; that is generated by the V — Brownian motion, М; 
is centered Gaussian with covariance function, for f, g € wg (w, p), given by 


Cov (07, Ma). (9, Mr) 


ty At? 
Vive | = | / [CCS Cf, 9), Ls) F (L6lf, g), m 


+ (Lo fy д5) (C29, As) (Q, из) 
— (L79, Ms) L2 f, As) 


— (L7 f, As) (Log, As) |ds 


Vava | 
(16) 


It is clear that if B7 = 0 for all п, then the limiting distribution-valued martingale 
M is centered Gaussian with covariance operator given by the (now deterministic) 
term within the expectation in (16). 

The main idea for the derivation of (15) comes from the proof of the convergence 
to the solution of (9). Define 


92 
a> f - у 
о (P) — «@. — X) (p) 


(Li f)(P) = 


forp = (А, a, А, о, Bo, 8°). Let's also assume for the moment that Bs = 0 for every 
n € №, i.e., let's neglect exposure to the exogenous risk X and focus on contagion. 
Then we can write the evolution of ( ГИЧ ) as 


N 
АЖЕ үе үеге Ves dt — E У ФЦ 
N l 


m > 2 |/ (v + Br )- РФ? | Liq, <А Mtn «dt + dM; 
N (217, и; ? Jat + (Lof, D MIC up dt ^t dM, 


where M is a martingale which may change from line to line. This leads to (9), when 
B3 = 0 for every n є N, see [20]. 
To get the Gaussian correction, we see that 


ар а) (eif. SY) (сол, SY) (9 ul) + of no (Q, 80) dt an 


where M is a martingale. For large N, M should be Gaussian, in which case E" is 


indeed a Gaussian process. Putting the systematic risk process X back into (2)-(5), 
one recovers the result of Theorem 4.1. 
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5 Analysis of Tail Events: Large Deviations 


Once we have identified what is typical, we can study the structure of atypically 
large failure rates. Large deviations outlines a circle of ideas and calculations for 
understanding the origination and transformation of rare events (see [42, 43]). Large 
deviation arguments allow us to identify the “dominant” way that rare events will 
occur in complex systems. This is the feature that is being exploited in [26], i.e., how 
different sources of stochasticity can lead to system collapse. 

By the discussion in Sect.3, we have that the pool has a default rate Lr = 
1 — ur (P) at time T. Let's fix £ > Lr. Then пту oo р > £} = 0; it is a rare 
event that the default rate in the pool exceeds #. We want to understand as much as 
possible about {LM > 1}. 

Using, the theory of large deviations, we can understand both how rare this event 
is, and what the “most likely" way is for this rare event to occur. Events far from 
equilibrium crucially depend on how rare events propagate through the system. Large 
deviations gives rigorous ways to understand these effects, and we want to use this 
machinery to understand the structure of atypically large default clusters in the port- 
folio. A reference for large deviations is [44]. 

If we have that 


P(LP ~ р} © exp[—NI(g)], as № — oo 


for some appropriate functional 7, then by the contraction principle we should 
have that 
P{L} ~ £) © exp[-N1'(], as N > oo (17) 


where 
I'(£) = inf(1(g) : p(T) = 6 (18) 


(in other words, 7’ is the large deviations rate function for Lx ). This gives us the rate 
at which the tail of the default rate LX decays as the diversification parameter grows. 
More importantly, though, the variational problem (18) gives us the preferred way 
which atypically large default rates occur. Namely, if there is a фу : [0, T] — [0, 1] 
such that 


ГО) = 1007) 
then for any à > 0, the Gibbs conditioning principle suggests that 


lim РЦ — er > 8107 > 0) = 0. 
Моо 


Insights into large deviations of (2)-(5) have been developed in [26] when e |, 0 
and when = = O(1) as N Z oo. We note here that in the case = = O(1), the large 
deviations principle is conditional on the systematic risk X. Such results allow us to 
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study the comparative effect of the systematic risk process X and of the contagion 
feedback on the tails of the loss distribution. 

Before presenting the result, let us first investigate numerically a test case, which 
is indicative of the kind of results that large deviations theory can give us. Apart 
from approximating the tail of the distribution, large deviations can give quantitative 
insights into the most likely path to failure of a system. 

For presentation purposes and for the rest of this section, we assume that = = 
£N | 0 as N ^ оо. Consider a heterogeneous test portfolio composed initially of 
М = 200 names. Let us assume that we can separate the names in the portfolio 
into three types: Type A is 16.67 96 of the names, Type B is 33.33 96 of the names 
and Type C is 50% of the names. For presentation purposes, we assume that all 
parameters but the contagion parameter are the same among the different types. In 
particular, we have the following choice of parameters. 

It is instructive to compare the different cases, based on whether there are conta- 
gion effects in the default intensities or not. In particular, we compare two different 
cases, (a) Systematic risk only: 85 Æ 0, ВС = 0, and (b) Systematic risk and conta- 
gion: 85 Æ 0, ВС 0. In each case, the time horizon is Т = 1. 

Using the methods of Sect. 3, one can compute that the typical loss in such a pool 
at time Т = 1. If contagion effects are not present, i.e., if BS = BS = BE = 0, 
then the typical loss in such a portfolio at time T = 1 is Lr = 42.5 %. If on the 
other hand, contagion (feedback) effects are present and the ВС parameters take the 
values of Table 1, then the typical loss in such a portfolio at time T = 1 has been 
increased to Lr = 72.1 %. In Fig.3, we plot the large deviations rate functions for 
each of the two different cases. As we saw in the beginning of this section, the rate 
function governs the asymptotics of the tail of the loss distribution. Notice that in 
every case, the rate function is convex and it becomes zero at the corresponding law 
of large numbers. 

Moreover, since the contagion parameter of Type A is higher than the contagion 
parameter for Type B or C, one expects that names of Type A will be more prompt 
to the contagious impact of defaults. Indeed, after computing the rate function and 
the associated extremals, as defined by large deviations theory, one gets the most 
likely paths to failure as seen in Figs. 4 and 5. The g(t) trajectories correspond to the 
contagion extremals for each of the three types, whereas the y(t) corresponds to the 
systematic risk extremal. 

One can make two conclusions out of Figs. 4 and 5. The first conclusion is related 
to the o extremals (Fig. 4). We notice that at any given time f, the extremal for Type 


Table 1 Parameter values for a test portfolio composed of three types of assets 


X 
Type A 0.5 2 0.5 0.2 1 1 10 
Туре В 0.5 2 0.5 0.2 1 1 3 
Type C 0.5 2 0.5 0.2 1 1 1 
карк 
We take ey = JN 
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Fig.3 Rate function governing the log-asymtptotics of the tail of the loss distribution 
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Fig. 4 Optimal y(t) trajectories for the three different types in the pool for t є [0, 1] and £ = 0.81 


A is bigger than the extremal for Type B, which in turn is bigger than the extremal of 
Type C. This implies that unlikely large losses for components of Type A are more 
likely than unlikely large losses for components of Type B, which are more likely 
than large losses for components of Type C. Thus, components of Type A affect 
the pool more than components of Type B, which in turn affect the pool more than 
components of Type C even though Type A composes 16.67 % of the pool, whereas 
Type B, composes 33.33 % of the pool and Type C composes 50 % of the pool. The 
second conclusion is related to the y extremals (Fig.5). We notice that the effect 
of the systematic risk is most profound in the beginning but then its significance 
decreases. 
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Fig. 5 Comparing optimal y (t) trajectories in the case of absence and presence of the contagion 
effects for t € [0, 1] and £ = 0.81 


Namely, if a large cluster were to occur, the systematic risk factor is likely to 
play an important role in the beginning, but then the contagion effects become more 
important. Assets of Type A are likely to contribute to the default clustering effect 
more, followed by assets of Type B and the ones that will contribute the least to the 
default cluster are assets of Type C. 

As it is also seen in the numerical experiments done in [26], the large deviations 
analysis help quantify the effect that the contagion and the systematic risk factor have 
on the behavior of the extremals (the most likely path to failure). An understanding 
of the role of the preferred paths to large default rates and the most likely ways in 
which contagion and systematic risk combine to lead to large default rates would 
give useful insights into how to optimally hedge against such events. 

Letus next proceed by motivating the development of the large deviations principle 
for the default timing model (2)-(5) that is considered in this paper. 

We denote scenarios, i.e., defaults, that are not in [0, Т] by an abstract point x not 
in [0, Т] and define the Polish space 


T = [0, T] U {x} 


To motivate things, let's first assume for simplicity that ВС = 6° = 0 and that 
the system is homogeneous, i.e., that p" — p for all n. Define 


di, = —a(, — Mt Fe /A,dW, t>0 
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with Ag = А. This Feller diffusion will represent the conditional intensity of a 
"randomly-selected" component of our (homogeneous and independent) system. 
Define the measure ио € (Кү) by setting 


1 
ио!0, t] = 1 — [е [-/ zl 
0 


for all t > 0; uo is the common law of the default times r,'s. 
In the independent case, i.e., when 8С = 0, standard Sanov's theorem [44], 
implies that {d L^ } yen has a large deviations principle with rate function 


dv 
Н (v, ро) =] In —— (t)v(dt) 
єт duo 


if v « ио and Н (v, цо) = œ if v « ио (1.е., Н (v, uo) is the relative entropy of v 
with respect to uo). By the contraction principle, the rate function for IN is 


IPS (p) = inf {H (v, ug) : v € Z (R4), v[0, t] = e(t) for allt € [0, T] and v[0, T] = £) 


In the independent case, we can actually compute both the extremal ф that achieves 
the infimum and the corresponding rate function /'’(¢) in closed form. 
Assume that ио[0, T] є (0, 1) and є (0, 1). Fix v є A(T) such that v[0, T] = 
£. Define 
шо(А N [0, TD АП, T]) 


uo,- (A) = - ue T] - and v-(A) = Г 


for all A є Z[0, T]. Then w_ and v_ are in Z[0, Т]. We can write that 


H(v, po) = £l hO adu a 
И Я (0a pots) 


vix (19) 


where / is entropy on Y[0, T]. We can minimize the A term by setting v- = uo.-—., 
and we get that 


ind,/ __ t ра 
Гі (фу = £1n i + (1— In TN (20) 


£ = 
= £ln +(1— £)In А 
mari р ао 


Systemic Risk and Default Clustering for Large Financial Systems 547 


This is in fact obvious; ЕХ = + jum liz, «7, and in this case the 1; <7)’s are i.i.d. 
Bernoulli random variables with common bias j40[0, Т]. The rate function 7 indc p) 
of (20) is the entropy of Bernoulli coin flips. Of more interest, however, is the optimal 
path. In setting v_ = u- ір (19), we essentially identify the optimal path 


HoI9, t] 


— £————, 
Мо 


where the last relation holds since we also require (Т) = £. 

It turns out that one can extend this result to give a generalized Sanov's theorem 
for the case ВС > 0, where dL feeds back into the dynamics of the A"'s. The 
case В > 0 can be treated using a conditioning argument and the well developed 
theory of large deviations for small noise diffusions. For the heterogeneous case, 
one needs an additional variational step which minimizes over all the possible ways 
that losses are distributed among systems of different types. Even though an explicit 
closed form expression for the extremals and for the corresponding rate function is 
no longer possible, one can still rely on numerically computing them. Let us make 
this discussion precise. 

To fix the discussion, let us assume (see [26] for the general case) that the exoge- 
nous risk X is of Ornstein-Uhlenbeck type, i.e., 


XQ = х, 


Let W* be a reference Brownian motion. Fix a name in the pool p = (Ao, о, A, c, 
ВС, Вэ) є P and time horizon T > 0. 

The Freidlin-Wentzell theory of large deviations for SDE's gives us a natural 
starting point. In the Freidlin-Wentzell analysis, a dominant ODE is subjected to a 
small diffusive perturbation; informally, the Freidlin-Wentzell theory tells us that 
if we want to find the probability that the randomly-perturbed path is close to a 
reference trajectory, we should use that reference trajectory in the dynamics. This 
leads to the correct LDP rate function for the original SDE. If we want to find the 
asymptotics of the probability that Сви x афр, єуах ғ а y) for some absolutely 
continuous functions ф and у, i.e., o, v € AC ([0, T], IR), we should consider the 
stochastic hazard functions 


di?" = адо У — jdt c oJ AP aw? + вСаф(@) + ВЗА ау) — t e [0, T] 


This will represent the conditional intensity of a “randomly-selected” name in our 


pool. Define next 
t 
fey) =B] af p|- f. i|]. 
= 
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where, we have used the superscript p to denote the dependence on the particular 
type. Then for every t є [0, T] we have that 


1 t t 
J DOE =1—Е E [-/ АГ =P { / re ds > | 
5=0 i s=0 s=0 


where e is an exponential(1) random variable which is independent of W*. In other 
words, TE i is the density (up to time Т) of a default time whose conditional intensity 


is 4%”. In fact, due to the affine structure of the model, we have an explicit expression 
for Lu (see Lemma 4.1 in [26]). 


For given trajectories o and y іп АС([0, Т]; R), define Bog € A(T) as 


ЖЫ) 


тєАГ[О,Т] 


Т 
тажа f fou] 


for all A є A(T). 
At a heuristic level one can derive the large deviations principle as follows. Let 
us assume that we can establish that 


PIL” ~ ф|Х® ~ y} © exp [-N1° (p, V)] 
and that {х^ —ENX,N < оо} also has large deviations principle in С([0, Т]; IR) 
with action functional Jy; i.e., 
N 1 
P[x" ~ v] = exp | - xe) 
EN 


as N // oo. Then, we should have that 
1 
P{LY ~ o, XP ~ у) ~ exp ЕХ у) – gen 
N 


In fact, the previous heuristics can be carried out rigorously and in the end one derives 
the following rigorous large deviations result. 


Theorem 5.1 (Theorem 3.8 in [26]) Consider the system defined in (2)-(5) with 
limy—+oo £y = 0 such that іту. оо Ne, = c € (0, оо) and let T < oo. Under 
the appropriate assumptions the family {LX , N € N} satisfies the large deviation 
principle, with rate function 


I'(£) — inf lro. Y):p € C(Px[0o, TD. y €C (L0, T), v(0) = pp, 0) = 0, 


joja e(p. s)U (dp), G(T) = Е 
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where if € AC (P x [0, T], V € AC ([0, TD , (0) = ф(р, 0) = 0, then 
пер) = fH (o). uy) Uap) + Je) 
i P Ow c 


and І(ф, ү) = œ otherwise. Here, Jx (V) is the rate function for the process 
(ey X", № < oo). Namely, for y € AC ([0, T]; R) with (0) = О we have 


T 
Jk 2 1 | pio +уу as 
2 Jo 


and Jx (V) = oo otherwise. I'(£) has compact level sets. 


If the heterogeneous portfolio is composed by K different types of assets with 
homogeneity within each type, then Theorem 5.1 simplifies to the following expres- 
sion. 

For ё, o, v € АС([0, Т] let us define the functional 


: | 
E(t) : 1—é(T) 

PE, o, рз] 1 (t)dt +1 (1 (T) 

Е (Gi) (a e) | 


Due to the affine structure of the model, we have an explicit expression for s " (see 
Lemma 4.1 in [26]). 

Assume that к; % of the names are of type А; with i = 1,..., К and yy Kj = 
100. Setting g(p, s) = pe ТФА; (5) Хүрд, } (D). we get the following simplified 
expression for the rate function 


K 


K 
{ i А 1 
I'(0) = inf уз БР Aj o + LI ЧО: OO = Э 


Ki 


100 9A; (t) for every t € [0, Т] 


Ф(Т) = £, рд, (0) = V (0) = 0, фд,, y € АС@0, T]) for every i = 1, ..., K 


An optimization algorithm can then be employed to solve the minimization problem 
associated with /'(£) and compute the extremals фа, fori = 1,..., К and y. This 
is the formula that the numerical example presented in Figs.4 and 5 was based on. 
In the numerical example that was considered there we had three types, 1.е., К = 3. 

The large deviations results have a number of important applications. Firstly, they 
lead to an analytical approximation of the tail of the distribution of the failure rate 
LN for large systems. These approximations complement the first- and second- order 
approximations suggested by the law of large numbers and fluctuations analysis of 
Sects. 3 and 4 respectively and facilitates the estimation of the likelihood of systemic 
collapse. Secondly, the large deviations results provide an understanding of the “pre- 
ferred" ways of collapse, which can also be used to design "stress tests" for the 
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system. In particular, this understanding can guide the selection of meaningful stress 
scenarios to be analyzed. Thirdly, they can motivate the design of asymptotically 
efficient importance sampling schemes for the tail of the portfolio loss. We discuss 
some of the related issues in Sect. 6. 


6 Monte Carlo Methods for Estimation of Tail Events: 
Importance Sampling 


Suppose we want to computationally simulate Bn > £}, where lim yoo р^ > 
£} = 0 again holds. Accurate estimates of such rare-event probabilities are important 
in many applications areas of our system (2)-(5), including credit risk management, 
insurance, communications and reliability. Monte Carlo methods are widely used 
to obtain such estimates in large complex systems such as ours; see, for example, 
[29, 30, 45—51]. 

Standard Monte Carlo sampling techniques perform very poorly in estimating 
rare events (for which, by definition, most samples can be discarded). Importance 
sampling, which involves a change of measure, can be used to address this issue. 
In general, large deviations theory provides an optimal way to ‘tilt’ measures. The 
variational problems identified by large deviations usually lead to measure transfor- 
mations under which pre-specified rare events become much more likely, but which 
give unbiased estimates of probabilities of interest; see for example [28, 34, 52—56]. 

Let I" be any unbiased estimator of PL > £} that is defined on some prob- 
ability space with probability measure Q. In other words, Г“ is a random variable 
such that EQT'" = Ps > £}, where EQ is the expectation operator associated 
with E. In our setting, it takes the form 


NL] dP 
r = {Lr >8 GQ’ 


where w is the associated Radon-Nikodym derivative. 


Importance sampling involves the generation of independent copies of ГУ under 
Q; the estimate is the sample mean. The specific number of samples required depends 
on the desired accuracy, which is measured by the variance of the sample mean. 
However, since the samples are independent it suffices to consider the variance of 
a single sample. Because of unbiasedness, minimizing the variance is equivalent to 
minimizing the second moment. An application of Jensen's inequality, shows that if 


1 
lim inf —— In AQ Ny = -27'(2), 


N—oo 


then I achieves this best decay rate, and is said to be asymptotically optimal. One 
wants to choose Q such that asymptotic optimality is attained. 
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To motivates things let us assume for the moment that ВС = 85 = 0 and that 
the system is homogeneous, i.e., that p" — p for all n. In the independent and 
homogeneous case, E, = l(;-r, are 11.1. random variables such that for every 
t € [0, T] 


ЗЕЛЕ ЕЕ ер [28%] = КОТ 
0 


For notational convenience, we shall define 


T 
pe fo,o(s)ds 


It is easy to see that, 
NLY ~ Binomial(N, р) 


To minimize the variance, we need to increase the probability of defaults. Define 


AN (ө; t) = In [2 | 


А simple computation shows that 
А; = lim LAWON =n (p (e? — 1) +1) 
N—oo N 


Define 
pe? 


Po = үе т) 


Clearly ро — p. Notice that the density of a Binomial(N, p) with respect to a 
Binomial(N, po) is 


N = 


a- M(E Qz2) оне е 


po 1— po п=1 
ем(—917+А(0;Т)) 


Therefore, for 0 fixed, the suggestion is to simulate under a new change of measure, 
under which N LY ~ Binomial(N, po) and to return the estimator 


M 


lu retten 
L7” 
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It is clear that this estimator is unbiased. We want to choose 0 that minimizes the 
variance, or equivalently the second moment. For this purpose, we define the second 
moment 


006,6) = Бог? = Eo | цид) 
Notice that 
1 1 i | 
-у0@%.0)>—-2-М (—0£+ A@; T)) = 2(0€ — А(Ө; T)) 


Due to convexity of A(0; T), we have that the maximizer over 0 € [0, oo) of the 
lower bound is at 0* such that dA) = £. In particular, (recall that D = p) 


we have 


£(1— p) : 
9* — In pü-0* fl> p 
0, if£< p 


This construction means that under the new measure, we have 
Pox {Tn < Т} = pox = €. 
In fact, we have the following theorem. 


Theorem 6.1 Let0* > 0 such that menn = £. Then asymptotic optimality holds, 
in the sense that 


1 | 
lim —— In O(£,6*) = 21'"®'(# 
Imc. n Q(£, 07) (0) 
where І! (£) is defined іп (20). 


Proof By Jensen's inequality we clearly have the upper bound. Namely, for every 
0 € [0, оо) 


1 : 
lim sup — = In 0600,0) < 21*(0) (21) 


Моо 


Now, we need to prove that the lower bound is achieved for 0 = 0*, i.e., that 


fiint dn Q(£, 0*) > 21d (p) (22) 
Noo N ' T 
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Recalling that 0* — In A and p — Ts fo.o(s)ds, we easily see that 


E 1 n 
lim inf — In 006,0") > 2 (9*t — А(Ө*; Т)) 
=2 (o*t — In (pe —1)+ 1)) 


£ 1-4 
- 3 (tm +1- 81n ) 

p 1—р 
= gna 


This concludes the proof of the theorem. 


In the heterogeneous case, i.e., if p" can be different for each n € N, then N re = 


у —1 Цо, <т) is no longer Binomial, but itis a sum of independent (but not identically 
distributed) Bernoulli random variables with success probability 


T 
Pn =] fe o(s)ds 


indexed by n. Due to independence, similar methods as the one described above can 
be used to construct asymptotically efficient importance sampling schemes in the 
heterogeneous case. 

The scheme just presented essentially amounts to a twist in the intensity of the 
defaults. However, in contrast to the independent case, i.e., when В с = 8° = 0, the 
situation in the general dependent case ВС, 85 5 0 is more complicated. Notice also 
if at least one of the BE % is not zero, then the model (2)-(5) does not fall into the 
category of the doubly-stochastic models, so techniques as the ones used in [45] do 
not apply. Also, implementation of interacting particle schemes for Markov Chain 
models as the ones developed in [29, 47] do not readily apply for such intensity 
models. The re-sampling schemes of [48] could apply in this setting, but one would 
need to construct an appropriate mimicking Markov Chain, something which is not 
clear how to do in the current setting. 

We briefly present here an importance sampling scheme for the case that there 
exists at least one BE z 0 and also applies independently of whether the systematic 
effects are present in the model or not. The suggested measure change essentially 
mimics the principal idea behind the measure change for the independent case. To 
be more precise, one directly twists the intensity of NLY = px li, -r)- 

Let {5} be the arrival times of NLY and notice that [4 > t = { 5гемт < T}. 
Let M? = унуу and p > 1 be some progressively measurable twisting process. 
Then, define the measure © via the Radon-Nicodym derivative 


Srp S 
Zy = e- fo ово) fol" (1-08) Y ху мраз. 
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тем N 
It is known that if IE [e 241 (i) < осо, then Q defined by 45 = ZN 


is a probability measure and it can be shown that № LN admits Q—intensity 
oN bx А M? on the interval [0, Srex1). 

This construction gives us some freedom into choosing appropriately the twisting 
process oN . Different choices of the twisting process oN are of course possible. For 
tractability purposes we restrict attention to a one-parameter family and set 


BN 


ӨМ = —V——— 
~~ N 
5.4 хм? 


5 


+1. 


For any В > 0 and under the measure induced by Zy, i.e., under Qg, the process 
NLN has intensity x 124 M? + BN оп [0, Steny), i.e., it amounts to an additive 


n= 
shift of the intensity. Thus, 6 is a superimposed default rate and its role is to increase 
the default rate in the whole portfolio. 

The purpose then is to optimize the limit as N — oo of the upper bound of the 
second moment of the resulting estimator over В. This is the measure change that is 
investigated in [57], and it is shown there that there is a choice of В = 8* for which 
asymptotic optimality can be established. Namely, there is a choice of 8 = В* that 
minimizes the second moment of the estimator in the limit as N — oo. We refer the 
interested reader to [57] for implementation details on this change of measure for 
related intensity models and for corresponding simulation results. 


7 Conclusions 


We presented an empirically motivated model of correlated default timing for large 
portfolios. Large portfolio analysis allows to approximate the distribution of the loss 
from default, whereas Gaussian corrections make the approximation valid even for 
portfolios of moderate size. The results can be used to compute the loss distribu- 
tion and to approximate portfolio risk measures such as Value-at-Risk or Expected 
Shortfall. Then, large deviations analysis can help understand the tail of the loss 
distribution and find the most-likely paths to systemic failure and to the creation of 
default clusters. Such results give useful insights into the behavior of systemic risk 
as a function of the characteristics of the names in the portfolio and can be also 
potentially used to determine how to optimally safeguard against rare large losses. 
Importance sampling techniques can be used to construct asymptotically efficient 
estimators for tail event probabilities. 
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Estimation of Volatility Functionals: 
The Case of a ./n Window 


Jean Jacod and Mathieu Rosenbaum 


Abstract We consider a multidimensional Itó semimartingale regularly sampled on 
[0, г] at high frequency 1/A,, with A, going to zero. The goal of this paper is to 
provide an estimator for the integral over [0, т] of a given function of the volatility 
matrix, with the optimal rate 1/4/A, and minimal asymptotic variance. To achieve 
this, we use spot volatility estimators based on observations within time intervals of 
length kn Ay. In [5], this was done with kn — oo and kn / ^, — 0, andacentral limit 
theorem was given after suitable de-biasing. Here we do the same with the choice 
kn x 1A As. This results in a smaller bias, although more difficult to eliminate. 


Keywords Semimartingale - High frequency data - Volatility estimation * Central 
limit theorem - Efficient estimation * Estimation of volatility functionals - Asymptotic 
aspects 
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1 Introduction 


Consider an Itó semimartingale X;, whose squared volatility c; (a d x d matrices- 
valued process if X is d-dimensional) is itself another Itó semimartingale. The process 
X is observed at discrete times iA, for i = 0, 1,..., the time lag A, being small 
(high-frequency setting) and eventually going to 0. The aim is to estimate integrated 
functionals of the volatility, that 15 h g(cs) ds for arbitrary (smooth enough) func- 
tions g, on the basis of the observations at stage n and within the time interval [0, t]. 
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This is of course a quite well understood problem when g is the identity function. 
In particular, when X is one-dimensional and continuous, Js g(cs) ds corresponds 
then to the integrated (squared) volatility, which can be efficiently estimated using 
the so-called realized volatility, that is the sum of the squared increments of X. How- 
ever, many other functions g are of interest. For example, in the case of the realized 
volatility mentioned above, the quantity i e ds, called quarticity and corresponding 
to g(x) — x?, appears in the asymptotic variance of the estimator. Therefore, esti- 
mating the quarticity becomes necessary if one wants to build confidence intervals 
for the integrated volatility. Actually, in the context of volatility estimation, for most 
statistical procedures, the asymptotic variance is a combination of terms of the form 
|) g(cs) ds, see [4]. Hence the statistician needs to be able to estimate such quanti- 
ties. Note that the functions g involved in limiting variances are often polynomial. 
Nevertheless, more complicated expressions may also be found, in particular in the 
multi-dimensional setting in the presence of jumps. We refer to [5] for more details on 
the motivation for estimating general integrated functionals of the volatility process. 

In [5], we have exhibited estimators which are consistent and asymptotically 
optimal, in the sense that they asymptotically achieve the best rate 1/4/Л,, and 
also the minimal asymptotic variance in the cases where optimality is well-defined 
(namely, when X is continuous and has a Markov type structure, in the sense of [2]). 
These estimators have this rate and minimal asymptotic variance as soon as the jumps 
of X are summable, plus some mild technical conditions. 

The aim of this paper is to complement [5] with another estimator, of the same 
type, but using spot volatility estimators based on a different window size. In this 
introduction, we explain the differences between the estimator in [5] and the one 
presented here. 

For the sake of simplicity, we consider the case when X is continuous and one- 
dimensional (the discontinuous and multi-dimensional case is considered later), that 
is of the form 


t t 
x -xe« | beds | os dWs 
0 0 


and с; = e is the squared volatility. Natural estimators for V (g), — i g(cs) ds are 


[t/An]—kn+1 kn-1 
V (g); = An ME gc), where (24 = k ^ >, (X (ij) Aj m Х@+у—1)л„)° 
i=l nU чу 


(1.1) 
for an arbitrary sequence of integers such that kn — oo and kn A, — 0. One knows 


that vo» V (д), (when g is continuous and of polynomial growth). 
The variables c? are spot volatility estimators, and according to [4] we know that 
©} JAn] estimates c;, with a rate depending on the “window size” kn. The optimal rate 


1/ AE 4 is achieved by taking kn = 1/ /An.! When kpn is smaller, the rate is Ky, 


By kn х 1/A/ As, we mean a1/A/ Ag < kn € a2/A/ An, for some a; > 0 and a» > 0. 
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and the estimation error is a purely “statistical error"; when К, is bigger, the rate is 

kn An and the estimation error is due to the variability of the volatility process c; 
itself (its volatility and its jumps). With the optimal choice ky, = 1/4/A;, the estima- 
tion error is a mixture of the statistical error and the error due to the variability of c;. 

In [5], we have used a “small” window, that is kn «& 1/4/ A5. Somewhat surpris- 
ingly, this allows for optimality in the estimation of i g(cs) ds (rate 1/./Ay and 
minimal asymptotic variance). However, the price to pay is the need of a de-biasing 
term to be subtracted from V (g)", without which the rate is smaller and no Central 
Limit Theorem is available. 

Here, we consider the window size kn =< 1/4/A;. This leads to a convergence 
rate 1/./A,, for V (g)" itself, and the limit is again conditionally Gaussian with the 
“minimal” asymptotic variance, but with a bias that depends on the volatility of 
the volatility c;, and on its jumps. It is however possible to subtract from V (g)" a 
de-biasing term again, so that the limit becomes (conditionally) centered. 

Section2 is devoted to presenting assumptions and results, and all proofs are 
gathered in Sect. 3. The reader is referred to [5] for motivation and various comments 
anda detailed discussion of optimality. However, in order to make this paper readable, 
we basically give the full proofs, even though a number of partial results have already 
been proved in the above-mentioned paper, and with the exception of a few well 
designated lemmas. 


2 The Results 


2.1 Setting and Assumptions 


The underlying process X is d-dimensional, and observed at the times iA, for i = 
0, 1, ..., within a fixed interval of interest [0, t]. For any process we write А?Ү = 
Yi A, = Y(i-1A, for the increment over the ith observation interval. We assume that 
the sequence A, goes to 0. The precise assumptions on X are as follows. 

First, X is an It6 semimartingale on a filtered space (2, F, (775); 0, P). It can be 
written in its Grigelionis form, as follows, using a d-dimensional Brownian motion 
W and a Poisson random measure и on R+ x Е, where Е is an auxiliary Polish 
space and with the (non-random) intensity measure v(dt, dz) = dt & A(dz) for 
some c-finite measure А on £: 


X, = Xo + fo bsds + fp os dWs- fo Se ots: zZ) lyon) бё и) (ds, dz) 
+ Јо Je 96. 2) Loc op n (5, dz). 

(2.1) 

This is a vector-type notation: the process b; is R¢-valued optional, the process о; 


is R? @ RZ-valued optional, 6 = (w, t, z) is a predictable IR¢-valued function on 
Q x R+ x E and ||. || is the euclidean norm on R^. 
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The spot volatility process c; = 0,0; (* denotes transpose) takes its values in the 
set м] of all nonnegative symmetric d x d matrices. We suppose that c; is again 
an Itó semimartingale, which can be written as 


c; = co + fg bs ds + fo Fs dWs + fo Je 9G. 2) lig pen (6 — ds, dz) 


+ Jo Је 9G. 2 1ugqs iei HAS. dz), 
(2.2) 


with the same W and yp as in (2.1). This is indeed not a restriction: if X and c are 
two It6 semimartingales, we have a representation as above for the pair (X, c) and, 
if the dimension of W exceeds the dimension of X, one can always add fictitious 
component to X, arbitrarily set to 0, so that the dimensions of X and W agree. 

In (2.2), b and 5 are optional and д is as 6; moreover b and д are RY -valued. 
Finally, we need the spot volatility of the volatility and “spot covariation” of the 
continuous martingale parts of X and c, which are 


d d 
~ij,kl ~ij,m~kl,m ~i, jk ilc jk 
с = > Of,’ Op, G = > 0,0; `. 


m=1 1=1 


The precise assumptions on the coefficients are as follows, with r a real in [0, 1). 


Assumption (A’-r): There are a sequence (J;) of nonnegative bounded A-integrable 
functions on Е and a sequence (7,,) of stopping times increasing to оо, such that 


t<tmw) — 160, ғ, 2) AL+ бш, О ^ 1 < In. 


Moreover, the processes b; = b, — / (2, 2) осе, 2911) A(dz) (which is well 
defined), €; апас, are càdlàg or càglàd, and the maps t > (ш, t, z) are càglàd (recall 
that 6 should be predictable), as well as the processes b, + f бе, 2) (KSl, z)l) — 
1 ТЕЛЕ i) A(dz) for one (hence for all) continuous function к on R with compact 
support and equal to | on a neighborhood of 0. 

The bigger r, the weaker Assumption (A'-r), and when (A’-0) holds the process 
X has finitely many jumps on each finite interval. The part of (A'-r) concerning the 
jumps of X implies that >, ||AXs||" < oo a.s. for all £ < оо, and it is in fact 
"almost" implied by this property. Since r — 1, this implies Y <р ПАХ, < œ a.s. 


Remark 2.1 (A-r) above is basically the same as Assumption (A-r) in [5], albeit 
(slightly) stronger (hence its name): some degree of regularity in time seems to be 
needed for b, С, c’, 6 in the present case. 


2.2 A First Central Limit Theorem 


For defining the estimators of the spot volatility, we first choose a sequence К„ of 
integers which satisfies, as n — oo: 
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0 € (0, оо), (2.3) 


and a sequence и» in (0, оо]. The М -valued variables er are defined, component- 
Wise, as 


kn—1 
1 1 
ra m LA >, Ap X АХ" Цуд” Хин} (2.4) 
п п j=0 


and they implicitly depend on Ay, kn, un. 
P А А ПИЕ" 
One knows that Ch [An] €t for any f, and there is an associated Central Limit 


Theorem under (A’-2), with rate 1/ AM ^: the choice (2.3) is optimal, in the sense 
that it allows us to have the fastest possible rate by a balance between the involved 
“statistical error" which is of order 1/./k,, and the variation of c; over the interval 
[t,t + Kk; ^5], which is of order Vkn A; because с; is an Itó semimartingale (and 
even when it jumps), see [1, 4]. 

By Theorem 9.4.1 of [4], one also knows that under (A’-r) and if uj х A7 for 
р—1 | 


3) we have 


some w € c= 5 


[t/An]—kn +1 


t 
Vig); := An >, 4@) =} Vo - | g(cs) ds (2.5) 
i=l 


(convergence in probability, uniformly over each compact interval; by convention 
Y» v = Oif b < a), as soon as the function g on M} is continuous with 
ig(x)| x K(1 + ||x||?) for some constants K, p. Actually, for this to hold we need 
much weaker assumptions on X, but we do not need this below. Note also that when 
X is continuous, the truncation in (2.4) is useless: one may use (2.4) with и, = оо, 
which reduces to (1.1) in the one-dimensional case. 

Now, we want to determine at which rate the convergence (2.5) takes place. This 
amounts to proving an associated Central Limit Theorem. For an appropriate choice 
of the truncation levels, such a CLT is available for V (g)", with the rate 1/ МА», but 
the limit exhibits a bias term. Below, g is a smooth function on МЎ, and (ће two first 
partial derivatives are denoted as 0jxg and 9 Img Since any x € M} has d? com- 


ponents x/*. The family of all partial derivatives of order j is simply denoted as д/ g. 
Theorem 2.2 Assume (A’-r)forsomer < 1. Let g bea C? function on Mi such that 

19/900) = KA+ lx’, — 7 =0,1,2,3 (2.6) 
for some constants К > 0, p > 3. Either suppose that X is continuous and 


и [ АЎ — оо for some = < 1/2 (for example, un = oo, so there is no trunca- 
tion at all), or suppose that 


(0:7) 
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Then we have the finite-dimensional (in time) stable convergence in law 


1 
—— (V V Sa du qo A ZR 

Nd (д, — (д) 25 t 

where Z is a process defined on an extension (Q, F, CEiso, P) of 


(О, F, (F;):>0, P), which conditionally on F is a continuous centered Gaussian 
martingale with variance 


E((Z | F) = У [ Ojkg (Cs) Omg es) (cl ck" + сі") ds, 02.8) 


j,k,l,m=1 


and where, with the notation 
1 
G(x, у) = f (g(x + wy) – шд(х + у) -0—w)gG))dw, (2.9) 
0 


we have 


А} = —£ (g(co) + g(cr)) 
d j 
Aj-—3 У h” jm 98) (сз Лот + ed" cH) as 


j,k,l,m=1 
Z 0 ~jk, im 3 
A; — 12 ho jk, Im (Cs) ©$ 
j,k,l,m=1 
Аў = G(cs—, Acs). 
22 


Note that |G (x, y)| < K(1 + |х|)? ||y||?, so the sum defining A? is absolutely 
convergent, and vanishes when с, is continuous. 


Remark 2.3 The bias has four parts: 

(1) The first one is due to a border effect: indeed, the formula giving V (g)7 contains 
[t/An] — kn + 1 summands only, whereas the natural (unfeasible) approximation 
An pba g(c(i-1)^,) contains [t/ An] summands. The sum of the lacking kn — 1 
summands is of order of magnitude (К, — 1)A,, which goes to 0 and thus does not 
impair consistency, but it creates an obvious bias after normalization by 1/V An. 
Removing this source of bias is straightforward: since g(cs) is “under-represented” 
when s is close to О or to t, we add to V(g)/ the variable 


(kn =, DA, 


2 (gC) + GH As d-ks41))- 


Of course, other weighted averages of g(c7 ) for i close to 0 or to [t/An] — kn + 1 
would be possible. 
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(2) The second part A? is continuous in time and is present even in the toy model 
given by X, = yc W, with c a constant and A, = I and T — 1. In this simple case, 
the interpretation is as follows: instead of taking the “optimal” g(c;) for estimating 
g(c), with & = У" ., (A7 Х)?, one takes 1 oia 9(С;) with С" a "local" estimator 
of c. This adds a statistical error which results in a bias. Note that, even in the general 
case, this bias would disappear, were we taking in (2.3) the (forbidden) value 0 = оо 
(with still kn An — 0), at the expense of a slower rate of convergence. 

(3) The third and fourth parts A? and A^ are respectively continuous and purely 
discontinuous, due to the continuous part and to the jumps of the volatility process 
c; itself. These two biases disappear if we take 0 = 0 in (2.3) (with still А, — oo), 
again a forbidden value, and again at the expense of a slower rate of convergence. 

The only test function g for which the last three biases disappear is the identity 
g(x) = x. This is because, in this case, and up to the border terms, V (g)/ is nothing 
but the realized quadratic variation itself and the spot estimators С” actually merge 
together and disappear as such. 


Remark 2.4 Observe that (2.7) implies г < 1. This restriction is not a surprise, since 
one needs r < l in order to estimate the integrated volatility by the (truncated) 
realized volatility, with a rate of convergence 1/4/A,. When r = 1, it is likely that 
the CLT still holds for an appropriate choice of the sequence un, and with another 
additional bias, see e.g. [6] for a slightly different context. Here we let this borderline 
case aside. 


2.3 Estimation of the Bias 


Now we proceed to “remove” the bias, which means subtracting consistent estimators 
for the bias from V (д). As written before, we have 


kny А Р 

J 

Аг = T (901) + Iaat) — Ar (2.10) 
(this comes from sq and Tirna e plus cj. = с; a.s.). Next, 


observe that A? = 1 V (h) for the test function л defined on Mj by 


d 
1 2 jl „К j kl 
h(x) — 2 > Os mI) [e x! 4 х7" x ). 
j,k,l,m=1 


Therefore 


АТ? = V (hy! — A2, Q.11) 


kn v An 
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The term A? involves the volatility of the volatility, for which estimators have 
been provided in the one-dimensional case by Vetter [7]; namely, if d — 1 and under 
suitable technical assumptions (slightly stronger than here), plus the continuity of 
X; and с;, he proves that 


[t/An]—2kp +1 


3 2 
Kn > (Gi + ©) 


ї=1 


converges to Jn (6; + E (с,)2) ds. Of course, we need to modify this estimator here, 


in order to include the function 0*g in the limit and account for the possibilities of 
having d > 2 and having jumps in X. We propose to take 


[t/ An]— 2kn+1 
3 УА jk. rond d 
br UMP У бә) шуп qm 


8 С +, i 
j,k,l,m=1 


(2.12) 


When X and c are continuous, one may expect the convergence to A? — 5 A? (observe 


that Ума ~ oe 6), and опе may expect ће same when X jumps i c is still 
continuous. кош in (2.4) the truncation basically eliminates the jumps of X. In 
contrast, when c jumps, the limit should rather be related to the "full" quadratic 


variation of c. Indeed we have the following theorem. 


Theorem 2.5 Under the assumptions of Theorem 2.2, for all t > 0 we have 


1 
Дз РУ. eu SAIA HAM, 
where 
А =0 >) G'(cs-, Acs) 
51 
апа 
С'(х, У) = —< >> fi O5, im IE) + дк im g(x + (1 — w)y)) ш? y/* y !" dw. 


8 k,l,m 
(2.13) 


At this stage, it remains to find consistent estimators for А? — А/, which has the 
form 
At Ай 20 у G"(c-, Acs), where С" = G — G'. 


m 
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More generally, we aim at estimating 


ҮР), = У Ё(с;—, Acs), 


s<t 


at least when the function F on M} x Ма, where Ма is the set of all d x d 
matrices, is C! and |F (x, y)| < K||y||? uniformly in x within any compact set, as is 
the function G" above. 
The solution to this problem is not as simple as it might appear at first glance. We 
first truncate from below, taking any sequence и” of truncation levels satisfying 
и! 1 


Z — oo forsome a є (0, =). (2.14) 


и! 0, ; 
Ae 8 


n 


Second, we resort on the following trick. Since С? is “close” to the average of c; over 
the interval (A5, (i + kn) An], we (somehow wrongly) pretend that, for all j: 


ds € ((/ — DE, An, jk; ^n] with Ас, | > и» 


^n ^n / 7h 7n 
e IPs Cj 2e, > Un AC; Ck, — C(j-2)* 


XT ^h п тп ei п 
Пб рур, — узу, V ео, 7 бу-у! < MCF, = 0-2), l- 


The condition (2.14) implies that for n large enough there is at most one jump of size 
bigger than и” in each interval ((i — 1) A5, (i — 1+k,)A,] within [0, г], and no two 
consecutive intervals of this form contain such jumps. Despite this, the statement 
above is of course not true, the main reason being that С” and c? do not exactly 
agree. However it is "true enough" to allow for the next estimators to be consistent 
for V(F);: 


Set bs An]-3 
VE) = 3 F( зуы+1,; FO) иб" envi? 21и, «19721 "T 
HL —— ` p i! ` 
where OFC = Cj. 41 — Суук у: 


Since this is a sum of approximately [t/k, An] terms, the rate of convergence of 
V(F)? towards V(F), in law is probably 1/ An! à only. However, here we are looking 
for consistent estimators, and the rate is not of concern to us. Note that, again, the 
upper limit in the sum above is chosen in such a way that V(F)/ is computable on 
the basis of the observations within the interval [0, т]. 


Theorem 2.6 Assume all hypotheses of Theorem 2.2, and let F be a continuous 
function on R4 x К satisfying, with the same p > 3 as in (2.7), 


[F(x, у) € KA + [xl + ly)? Uy’. (2.16) 


Then for all t > О we have 
p 
У(Е); — Y(F). (2.17) 
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2.4 An Unbiased Central Limit Theorem 


At this stage, we can set, with the notation (2.11), (2.12) and (2.15), and also (2.9) 
and (2.13) for G and G’: 


kn An 


2 
3 
EA (5 AP ar’) — kn An V(G — 67)". 


V(gy = У(д) + (JED + 9 © лт) 


We then have the following theorem, which is a straightforward consequence of 
the three previous theorems and of k,./A, — 9, plus (2.10) and (2.11) and the fact 
that the function С — G” satisfies (2.16) when g satisfies (2.6). 


Theorem 2.7 Under the assumptions of Theorem 2.2, and with Z as in this theorem, 
for all t > 0 we have the finite-dimensional stable convergence in law 


1 
VAn 


Note that 0 no longer explicitly appears in this statement, so one can replace (2.3) 
by the weaker statement 


= Lfs 
(Vig)i — V(g)) — 2. 


(this is easily seen by taking subsequences n; such that ky, VAn converge to an 
arbitrary limit in (0, оо)). 

Itis simple to make this CLT "feasible", that is, usable in practice for determining 
a confidence interval for V (g); at any time > 0. Indeed, we can define the following 
function on М]: 


а 
A(x) = OY Oj G(X) дһ g(x) (x^ x" + xim), 
J.k.lm-l1 


u.c.p. 


We then have V (h)? = У (Л), where V(h), is the right hand side of (2.8). Then 
we readily deduce: 


Corollary 2.8 Under the assumptions of the previous theorem, for any t > 0 we 
have the following stable convergence in law, where Y is an N (О, 1) variable: 


V(I —V(gh с-. 


= > Y, in restriction to the set {V(h); > 0]. 
y An V OD? 
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Finally, let us mention that the estimators V (g)” enjoy exactly the same asymptotic 
efficiency properties as the estimators in [5], and we refer to this paper for a discussion 
of this topic. 


Example 2.9 (Quarticity) Suppose d — 1 and take g(x) — x?, so we want tho 
estimate the quarticity [^ с? ds. In this case we have 


A(x) = 2х2,  G(x, y) – G(x, y) = 0. 
Then the “optimal” estimator for the quarticity is 


[t/An]—kn+1 [t/ A5] —2ks 4-1 


3 ^ 
Rime) Oe Gye 2, Xu 
& і=1 i=l 
(kn — 1)A 
tj — (€D laua )- 


The asymptotic variance is 8 /, i rod ds, to be compared with the asymptotic variance 


of the more usual estimator 3A. SUM (A? X)*, which is X Js c: ds. 


3 Proofs 


3.1 Preliminaries 


According to the localization Lemma 4.4.9 of [4] (for the assumption (K) in that 
lemma), it is enough to show all four Theorems 2.2, 2.5-2.7 under the following 
stronger assumption. 


Assumption (SA'-r): We have (A'-r). Moreover, we have for a A-integrable function 
J on Е and a constant А: 


DI, Ib 1511, Well. 111, 1271, J < А, 
1500, t, DI x ЛС), Go t ОЦ? < JG). (3.1) 


In the sequel, we thus suppose that X satisfies (SA’-r), and also that (2.3) holds: 
these assumptions are typically not recalled. Below, all constants are denoted by К, 
and they vary from line to line. They may implicitly depend on the process X (usually 
through A in (3.1)). When they depend on an additional parameter p, we write Kp. 

We will usually replace the discontinuous process X by the continuous process 


t t 
ы! bas | os dW;, (3.2) 
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connected with X by X, = Xo + X; + bam АХ. Note that b’ is bounded, and 
without loss of generality we will use below its càdlàg version. Note also that, since 
the jumps of c are bounded, one can rewrite (2.2) as 


t t 2 
Ct =в+ | һа | zaw, + | (КЁ (и — v)(ds, dz). 
0 0 0 JE 


This amounts to replacing Ё in (2.2) by bj, + fp 504, z) (KÒ, 21D 
= lae pip A(dz), where & is a continuous function with compact support, 
equal to | on the set [0, A]. Note that the new process b is bounded càdlàg. 

With any process Z we associate the variables 


п(2), = /Е(зир,е чы Zito — Zell? 12), (3.3) 
and we recall Lemma 4.2 of [5]: 


Lemma 3.1 For all t > 0, all bounded càdlàg processes Z, and all sequences 


Un = О of real numbers tending to 0, we have Ay i ( yo (2) 1л) — 0, 


and for all 0 € v < s we have Е(П(2) 5 | Fr) < 102): 


3.2 An Auxiliary Result on Itó Semimartingales 


In this subsection we give some simple estimates for a d-dimensional semimartingale 


t t t 
hey bass | e! aw, + | J 20 - os a 
0 0 0 JE 


on some space (Q2, F, (Fi)r>0, P), which may be different from the one on which 
X is defined, as well as W and џ, but we still suppose that the intensity measure v is 
the same. Note that Yo = 0 here. We assume that for some constant А and function 


JY we have, with c! = o” g**: 


1071 < A, |с! < А2, [157 (o, t D1? x J' (o) < A’, ] «2 < А2, 
(3.4) 


The compensator of the quadratic variation of Y is of the form [A сї ds, where 
a = cY + Iz SY (t, z) 6! (t, z)* A(dz). Moreover, if the process с! is itself an Itó 
semimartingale, the quadratic covariation of the continuous martingale parts of Y 
and c” is also of the form IH 24 ds for some process T'Y , necessarily bounded if both 
Y and c” satisfy (3.4) (and, if Y = X, we have c = c and Y = 2). 
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Lemma 3.2 Below we assume (3.4), and the constant К only depends on A. 
(a) We have for t € [0, 1]: 


XY] Y? | Fo) — top?" | < Kt 4- Vin" )o + n©@ ot) < Kt, 


|E(Y, | Fo) — tbt | < tn(b o, < Kt 


(3.5) 


and if further || Ner = eb | Foll < А27 for all t, we also have 


zY/ Y" | Fo) — tay ^" | 2872 QA? t + Anbo) < Kt”. (3.6) 


(b) When Y is continuous, and if wiles — 65 Il? | Fo) < A*t for all t, we have 


IE Y, jk Y. jl Y,j 
z(v/ YF Y] v | Fo) — tco” су" +o” a + | < Kt. qu 


(c) When сї is a (possibly discontinuous) semimartingale satisfying the same 
conditions (3.4) as Y, and if Y itself is continuous, we have 


|Е((Ү/ YE — ter cn —су'"у | Fp) | < KPPWE+ NC o). G8) 


Proof The first part of (3.5) follows by taking the Fo-conditional expectation in the 
decomposition Y, = M; + tb% + Jo (bY — be ) ds, where M is a d-dimensional 
martingale with Mo = 0. For the second part, we deduce from Itó's formula that 
Y Y" is the sum of a martingale vanishing at 0 and of 


otl AN t . А Do. 
н vi" ds + | rias | ү] - bias « | Y; (by — by) ds 
0 0 0 0 


t 
_Ү,) _Y,j ~Y,j 
+ Co Tae 1 (c, 7" — Co I™) ds. 
0 


Since Е (|У, | Fo) € K At, as in (3.9), we deduce the second part of (3.5) and also 
(3.6) by taking again the conditional expectation and by using the Cauchy-Schwarz 
inequality and the first part. 

Equation (3.7) is a part of Lemma 4.1 of [5]. For (3.8), we first observe that Y; ý: — 
tog” = B, + M, and сї!" = су" = B| + Mj, with M and M' martingales (M 
is continuous). The processes B, В’, (M, M), (M', М”) and (M, M^) are absolutely 
continuous, with densities bs, bi, hs, А” and Л” satisfying, by (3.4) for Y and сї: 


АЕРА bZ | + |с} = col ILS К, Ih < ТЛЕ КЕЗ 


where h” = УС" 4 үй Үт Again as in (3.9) below, E(J|Y;]4 | Fo) < 
K,t4/? for all q, and E(\|cY — с} ||? | Fo) < Kt. This yields E(B? | Fo) < Kt? and 
(М2 | Fy) < Kt?. Since |B;| < Kt and E(M? | Fo) < Kt, we deduce that the 
Fo- conditional expectations of B; B;, B; М; and М; B, are smaller than К 12. 
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Finally E(M;M; | Fo) = E((M, M^) | Fo), and (M, M^); is the sum of 
a" |) YJ ds + I^ Үү; (СЗ ы e "уаз and a similar term with k and j 
exchanged. Then using again E(|Y;|? | Fo) < Kt, plus |E; | Foll < Kt 
and Cauchy-Schwarz inequality, we obtain that the above conditional expectation is 
smaller than K (t? + tT. This completes the proof of (3.8). 


3.3 Some Estimates 


(1) We begin with well known estimates for X' and c, under (3.1) and for s, > 0 
and q > 0, see [4] for details: 


(Бире ПХ, — ХД | Fe) < Ка 54/2, ЕС — X; l Foll < Ks 
© ( SU, eqo.s] [сро — cil? | Fi) < Kg s1^(a/2). | E(cr4s — cr | 75) < Ks. 
(3.9) 


Next, it is much easier (although unfeasible in practice) to replace C7 in (2.5) by 
the estimators based on the process X’ given by (3.2). Namely, we will replace c7 


by the following: 
kn—1 


ET 
с" = ER > At, X An, X". 
cm F 


The difference between c? and С, is estimated by the following inequality, valid 
when и, x A7 апда > 1, and where an denotes a sequence of numbers (depending 
оп un), going to 0 as n — со (this is Eq. 4.8 of [5]): 


Dc — ©" 14) < АЕА, (3.10) 


(2) The jumps of c also potentially cause troubles. So we will eliminate the “big” 
jumps as follows. For any р > 0 we consider the subset Е, = {z : J (z) > pj, which 
satisfies \(E,) < oo, and we denote by б? the o-field generated by the variables 
и@0, t] x A), where t > 0 and A runs through all Borel subsets of Е. The process 


N? = (00, t] x Ep) (3.11) 


is a Poisson process and we let S}, Sb. ... be its successive jump times, and ©, ‚р 
be the set on which 57 € {i^n : i > 1) for all j > 1 such that si « t, and 


БИ 4p >ta 8^ + (6k, i ПА, for all j > 0 (with the convention S$ = = 0: taking 6k, 
hore instead of the more natural k, will be needed in the proof of Theorem 2.6, and 
makes no difference here). All these objects are G’-measurable, and P(Qy у) — 1 
as n — oo, for all t, p > 0. 
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We define the processes 


b(p): = Б, — | Ò+, z) Жа), (р), =H GF + / Òl, z) (t+, z)* A(dz) 
Ep (Е) 
с(р): = ci - fo Je, (s, z) w(ds,dz) = c (p); +c (p), where 
c (D (g), = со + fo b(p)s ds + fy Fs dW, (3.12) 
cO (р), = fo fig y FE- 2) (и — vds, dz), 


so c(p), which is R? i & RI : -valued, is the càdlàg version of the density of the pre- 
dictable quadratic variation of c(p). Moreover G^ = {0, Q} and (b(p), c(p)) — (b, с) 
when p exceeds the bound of the function J. Note also that b(p) and c(p) are càdlàg. 

By Lemma 2.1.5 and Proposition 2.1.10 in [4] applied to each components of X’ 
and c® (р), plus the property ||b(p)|| < K/p, for allt > 0, s € [0, 1], p € (0, 1], 
а > 2, we have 


х ( ирер, ПХ — XII | F; V GP) < Kg 54? 
E(Xt45 — X; | Fs v GY + || Е(с(о), — c(9) | Fs v 6?) < Ks 
Z( up, eto, llc itu — cO ONI | Fi v G^) < Kg bp G + 84/7) 


* ( Sup, ero.] llc Co)i-ew n c(p): |l? | Fi У g^) < Kg (bps + 54/2 + т) < Кар s. 


(3.13) 
where à, = Jex J(z) A(dz) — 0 as p — 0. Note also that (ЖОЛ x K/p. 
(3) For convenience, we put 
b? = bG-1)Ay> Cj = Ci) A, 
b(p)? = bai-a СОО)? = COG- CO; = €(—)G-Na,> (3.14) 
F? = Ға-рль Р = FRY GP. 


АП the above variables аге ge -measurable. Recalling (3.3), and writing 
n(Z, (ШЧ): if we use the filtration (H+) instead of (F;), we also set 


np); = max(n(¥, (0° V. Fit) an. jas : Y = b', bp), с,с(р), €), 
MP); = NO о, · 
Therefore, Lemma 3.1 yields for all t, р > O and j, К such that у + k < 2К„: 
[t/An] 


AnE( >) nu) 0. EMOA l FEO my. (3.15) 


i=1 
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We still need some additional notation. First, define G°-measurable (random) set 
of integers: 


L(n,p) = {i = 1,2,...: М 2h) Ay — NGA, = 0} (3.16) 


(taking above 2k, instead of k, is necessary for the proof of Theorem 2.5). 
Observe that 


ieL(n,p) 0x < 26 +1 = Cj с? = соу) = с(р)". (3.17) 
Second, we define the following R4 & R¢-valued variables 
ой = AT X! AP X'* — cP A, 


k,—1 
BP =" — cP = р Уо (оү) + Ly – An) (3.18) 


n. n __<уп _ gn n n 
V =, TO = Bite, TO 


(4) Now we proceed with estimates. (3.13) yields, for all q > 0: 


(о I | E^) < Kg AR, zo? | £^ < КАЛУ”, 


` “Кп 3 E 
(Sat е ena. epe E) ek 


(3.19) 


the third inequality following from the first two ones, plus Burkholder-Gundy and 
Hólder inequalities, and the last inequality from the third one and the boundedness 
of c;. Moreover, since the set {i € L(n, p)} is G?-measurable, the last part of (3.13), 
(3.17), and Hólder's inequality, readily yield 


4/2 
422, іє1І(п,р) > Eli? 15") < Ka (V^ An фо + Ай + =). 
(3.20) 


(5) The previous estimates are not enough for us. We will apply the estimates of 
Lemma 3.2 with Y, = X; рд ,, — X@—1a, for any given pair n, i, and with the 
filtration (Fa-1)a,+t V G’):>0. We observe that on the set A(p, п, i) = {Aj € 2k, : 
i — j € L(n, p)}, which is G?-measurable, and because of (3.17), the process сї 
coincides with с(р) (1л, — (0) —1)л„ i£ Є 10, An]. Then in restriction to this 


set, by (3.6) апа (3.7) and by the definition of (р)? |, we have 


| RAP X Anm | F’) _ crm wl < КАУ? K+ п(0)" \) 
| (Ат X") A? X’ ATE" ATUM | г?) 
_ ОГАШ dE сті unm + pose < K An 


1 1 t 
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(the constant above depends on p, through the bound K/p for the drift of c(p)). 
Then a simple calculation gives us 


xo? | Z7 ^| < К po (LA n + 100)? 1) 
(a^ Jta nin |” P) (с п, Лып, km +e jm суд? | < 2 K, A 


1 А 1 


оп А(р, n, i). 


(3.21) 


Next, we apply Lemma 3.2 to the process Y; = c(p)(i 1)л, +4 — С(0) (1л, for 
any given pair n, i, and with the filtration (7; рл, +. V G?);-o. We then deduce 
from (3.5), plus again (3.17), that 


ieL(np,0ztzkAs— 
(с pase Сб DAS удани Qna FF) = tea | 
< Кы n); k, 
|E(cG—1) ante — са-1)л„ | 47) — 2600)? | < Куто)", < Kpt. 


(3.22) 


Moreover, the Cauchy-Schwarz inequality and (3.19) on the one hand, and (3.8) 


applied with the process Y; — Xt DA dt — Ха aA, on the other hand, give us 


|E(o дару" | Fi?) | < K Aun? 


a/n nkl A nums n.p 3/2 5 (3.23) 
|E(o7 ^ Алс" | Р) | € Kp An’ (VAn (GO |). 


i € L(n, p) >| 


(6) We now proceed to estimates on В? А 


Lemma 3.3 We have on the set where i belongs to L(n, р): 


n.jk 4n,lm n, n,jl n,km n,jm n,kl „Ак = п,]К1т 
um АЕ (С & 4574 deci | 


i i i 
1/4 
S Ko An (An (p) 
оп, jk , nl Jl ; „Аһ = „jkl 
EGP (e стт) FPO — Eie p] < Ko An Q/ Aa + 0(QYD- 


Proof We set G^; = o5, + (сууу — cj) An and write Zee as 


kn—1 kn—2 kn—1 kn—2 Ка—1 
n, ike n, Ina n, ike n, “ane n,lm eg D 
M Chu toa У) 255 Gio tg У) NES ` 
n nh y=0 nh hn y=0 v=u+1 n^n y=0 v=u+l 
(3.24) 
For the estimates below, we implicitly assumei € Г (п, р) апаи, v € (0,..., k,—1]. 


First, we deduce from (3.21) and (3.22), plus (3.23) and successive condition- 
ing, that 
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„и 1 1 


| и lm | F” P) (с) п, Лет km P jm anys < KAP. (3.25) 


Second, if u « v, the same type of arguments and the boundedness of b(p) гара с; 
yield 


SU. FRED — бр 67 An — (ор Ado — и — 0| 
n (Kns An TLLA 1) 
Ма, (opui E N 0) < KpAn? An +0...) 

а te =g] СЯБ < KAR (VAn ua 1) 

nen Gp, — Бо ЛЕ) | E < Kos Qf Ag HNO „ |) 
Xo" Брут | FREY) < КЪА G/ As HNO 1) 

Ne eum "ma — (5 | FEP) — (gy Anul < KpAnn(p)} 
du Lp a (FP EAST. 


ru 


= KA; 


Since у? Y н и = k3/6 + O(k2), we easily deduce that the F;”” -conditional 


expectation of the last term in (3.24) is е c(p); i ume An, up to a remainder term 

which is O(/A SATA + (o). and the same is obviously true for the second 

term. The first claim of the lemma readily follows from this and (3.24) and (3.25). 
The proof of the second claim is similar. Indeed, we have 


k,—1 
n.jk, n,lm nlm, __ 1 n,jk n,jk n, jk n,lm nlm 
B" (eu GO = LA > ‚(а Pie, G )An) (c, — 6 ) 
n^n 
и=0 


апа 


n,lm c im 
(с Cit ky T | Fi 


< K Anno)" 


n,lm n,lm nlm 
ete) Citai T Ci -b( О uai Ап (а — u — D| 


i+u+l,kn—u' 


Using the previous estimates, we conclude as for the first claim. 


Finally, we deduce the following two estimates on the variables у” of (3.18), for 
any g > 2: 


+(_n,jk | n,lm n.p 2 n,jl n,km n,jm n,kl 
(Ua FP = RG dq +e) 


ieL(n,p = 2А (py а ge Ag oe 
4 
Slag ll | FM) < K Rs bp + A4 + 5). 


(3.26) 


To see that the first claim holds, one expands the product ^7 alk uem and uses suc- 


cessive conditioning, the Cauchy-Schwarz inequality and (3.13), (3.17) and (3.22), 
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апа Lemma 3.3; the contributing terms are 


jk jl jk 1 jk jk 1 А 
pm p m + 836 we 4 (c qk — en NG au 2 d mj 


n,jk , nlm n,lm n,lm , n, jk n,jk 
S (ск -e )—' (Ср 8" ). 


For the second claim we use (3.13), (3.17) and (3.20), and it holds for all q > 2. 


3.4 The Behavior of Some Functionals of c(p) 


For p > 0, we set 


п иа 3 21 п $ 
Ule» IAM ш> шуа}, Where (3.27) 


up = E Yes суы — С(0)( y, +): 
The aim of this subsection is to prove the following lemma. 


Lemma 3.4 Under (SA'-r) and (2.14) we have 


lim limsup E(U(p);) = 0 


p>0 поо 


Assumption (SA’-r) is of course not fully used. One only needs the assumptions 
concerning the process c;. 


Proof With the notation (3.12), and for 1 = 1,2 we define pu (р)" апа U (p)? 
as above, upon substituting c(p) and w/,/4 with c® (р) and и” im Since U (р)! < 
4U (py +400) (о), it suffices to prove the result for each AC) (p); . 


First, le? (54? 1 {ll (o? | >и, /8) is smaller than Ke о)" /u'?, whereas 


n? 


(recalling IDOI < К /р) classical estimates yield (ILU D о)" | 4) < < KAg(1 + 
"(14+ An/p), yielding 


А» / p). Thus the expectation of U ® (p)? t is less than К A; 
the result for U(? (p). 


Secondly, we have UO (p)? < У ^"! [9 (py? |l? and the first part of (3.13) 


yields E(lu? (p)? ||?) < Кф VAn. Since 6 > 0 аз p — 0, the result for U ® (p)? 
follows. 


3.5 A Basic Decomposition 


We start the proof of Theorem 2.2 by giving a decomposition of V (g)" — V (д), with 
quite a few terms. It is based on the key property C = c? + 87 and on the definition 
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(3.18) of a? and 37. A simple calculation shows that JE (V(g)? — V(g) = 


x 1 y , as soon as t > Kk, ^5, where (the sums on components below always 
extend from 1 to d): 


[t/An]—kn+1 
улл у, (9) — 9@")) 


1 [t/An]—kn+1 


ү”? _ 
Л = 
An 


iAn 
f (g(c?) — д(с;)) ds 
i=l @—1)А„ 

1 [t/An]—kn+1 kn—1 
үз = д, g(c?) от!" 


I,m 


[t/An]-kn+1 kn-1 
n,4 м An n n,lm nlm 
V = > > Omg (C ) > (C; — Cj ) 
u-l 


kn i=l l,m 
1 t 
=F ges) ds 
VAn JA АД 
[t/ Aj] —k, 9-1 
l 
VP = УА, У (gc? +88) – 96D — > Amg(c?) BF"). 
i=l l,m 


The leading term is V3. the bias comes from the terms У" and V", and the 
first two terms are negligible, in the sense that they satisfy 


j=12 > Vio forallt » 0. (3.28) 


We end this subsection with the proof of (3.28). 
The case j = 1: (2.6) implies 


lg) — 90") x KA + ler + le"?! Ire? — ez 


І 
< KA + Ie e — e+ Кё — a M. 


Recalling the last part of (3.19), we deduce from (3.10), together with the fact that 
1-го – p(l — 20) < Soe for all g > 1 small enough and Hólder's inequality 


that E(|g@") — g(@")|) € Ka, ДР). Therefore 


(sup Г) < кеа, Az" t7 
st 


and (3.28) for j — 1 follows. 
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The case j — 2: Since g is C? and ‚С is ап Itó semimartingale with bounded char- 
acteristics, the convergence V^? 5 0 is well known: see for example the proof of 
(5.3.24) in [4], in which one replaces pe, (Ў) by g(cs). 


3.6 The Leading Term V":? 


Our aim here is to prove that 


y^3 £3 у (3.29) 


(functional stable convergence in law), where Z is the process defined in Theorem 2.2. 
A change of order of summation allows us to rewrite У" as 


[t/An] 1 G—1)^(k,—1) 
l 1 1 
үз = > Уи nlm al т. where ш", m m E page ;). 
"о i=l Lm ” j=(i—[t/An]+kn—1)* 


Observe that w; and а? are measurable with respect to F;’ and 7^ |, respectively, so 
by Theorem IX.7.28 of [3] (with G — 0 and Z — O in the notation of that theorem) 
it suffices to prove the following four convergences in probability, for all t > 0 and 
all component indices: 


1 [t/An]—kn+1 Р 
nlm, nlm n 
—— У wh Eh" | F") 0 (3.30) 
VAn i=l 
1 [t/An]-kn+1 . ^ 
a. È шр ep" EQ? ar | FP) 
n ja 
P f. 
+, | Ojk.g(cs) дһ (сз) (cl ск" + сі" c) ds (3.31) 
0 
[t/An]—kn+1 Р 
= >, [ш Ее Fa — 0 (3.32) 
п i=l 
1 [t/An]—kn+1 P 
У wh Eat! APN | FP) — 0, (3.33) 


where N = W/ for some j, or is an arbitrary bounded martingale, orthogonal to W. 
For proving these properties, we pick a p bigger than the upper bound of the 
function J, so С? becomes the trivial o-field and F? = pU and L (n, р) = N. In such 


580 J. Jacod and M. Rosenbaum 


a way, we can apply all estimates of the previous subsections with the conditioning 
o-fields 77. Therefore (3.19) and the property ||w?|| < К readily imply (3.30) and 
(3.32). In view of the form of a, a usual argument (see e.g. [4]) shows that in fact 
jam A" N | £7) = 0 for all N as above, hence (3.33) holds. 

For (3.31), by (3.21) it suffices to prove that 


[t/An]-kn+1 
n,jk || піт , n,jl n,km n,jm n,kl 
An ш” wy; (с; с; TG" od) 
i-l 


Р d p 
Z, | опас) дь) ccn + ei") аз 
0 


jk 

: i i» t л ie, t) => 
с] * almost surely if |i (n, t) A, — t| € kn A, (recall that c is almost surely continuous 
att, for any fixed t), and the above convergence follows by the dominated convergence 


theorem, thus ending the proof of (3.29). 


In view of the definition of ш" , for each t we have ш" —> д; kg (Cr) and с" 


3.7 The Term У" 


In this subsection we prove that, for all г, 


d 
yri E. КЛ | Om g(cs ) dc!" — 0 (с). (3.34) 
l,m 


We call y^ and pou respectively, the first sum, and the last integral, in the 
definition of we Since k, / ^, — 0 and c is a.s. continuous at f, it is obvious that 
АШЫ converges almost surely to —0 g(c;), and it remains to prove the convergence 
of АШ to the first term in the right side of (3.34). 


n n __ yu-lan 
We first observe that c?,,, — с? = = 0 Ат+ь 


т, d 


c. Then, upon changing the order 
of summation, we can аў у, 


[t/ ^n]—1 


үт = 5 ^w nlm А" іт 


i=l l,m 


G-DAKn-2) 
JĀ 
pe 8 > (kn — 1 — u)O1mg(ch_,)- 


Kn | 
u=0V (i--k, —1—[t/ ^5]) 


In other words, recalling kıy А, < К and ||Og(c;)|| < K, we see that 
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jou 228 H(n, a dcl", 


l,m 


where H (n, t), is аа x d-dimensional predictable process, bounded uniformly (in 
n, з, ш) and given on the set [kn An, t — kn An] by 


kn—2 
An n 
ER > ds —1—и)дмд(с]_„) 


и=0 


@—1)А„ «s <А, > H(n t)” = 


(its expression on [0, kn An) and on (t—k, An, t] is more compres but not needed, 
apart from the fact that it is uniformly bounded). Now, since X 2 ? (ks —1—и)= 

k2/2 -FO(Kk4) as n — оо, we observe that H (n, pp converges to 5 ? Or g(cs—) for all 
s € (0, t). Since c is a.s. continuous at t, we deduce from the dominated convergence 
theorem for stochastic integrals that V, ' má indeed converges in probability to the first 
term in the right side of (3.34). 


3.8 The Term У" 


The aim of this subsection is to prove the convergence 


p 
VA) = 24; 9$. fi (g(cs— + wAcs) — (e) — w У Anges) Асу") dw. 


s<t l,m 
(3.35) 
We have V" = >); Hm аш өр, where 
= VAn (get + BP) — 91) — > Om (c) 8”). 
I,m 
We also set 
— 1 Ка—1 ==] 1 kn—1 
а? = Kn An рана =0 Ow Br = B; n а? = = Ky &u=1 (сы = ej), 
a = / Nn (g(c? + В") ze (cr) > Ва) Өт"), ш? = v? _ v", 
‚т 
(3.36) 


We take p € (0, 1], and will eventually let it go to 0. With the sets L(n, p) of 
(3.16), we associate 


L(n, p,t) = (, .... [t/An] — kn + 19 L(n, p) 
L(n, p, t) = {1,...,[t/An] — kn + 1)\ (n, p). 


582 J. Jacod and M. Rosenbaum 


We split the sum giving уз into three terms: 
pn = * 0", и? = >, v, pose _ >. ul”, (3.37) 
ieL(n,p,t) ieL(n,p,t) ieL(n,p,t) 

(A) The processes 7. A Taylor expansion and (2.6) give us 
VDE = Ут" ы» jas aep) EGET Be” | FPP) 

v? = v(D7 + 002) -v(3)7, where ue УА X ps ни) gri ge - v(1)" 
WOFI € КУА, (1 + L8? 4077 IMP. 

Therefore 


3 
и"? =X UG”, where U(j)? = A» v(j)}. (3.38) 


j=l ieL(n,p.t) 


On the one hand, letting 


1 i i Я kny А j 
и 2 4 n,jl n,km n,jm n,kl n n —, An,jklm 
w(p) = 2 Fe mI) Ea AI ip с" pe ue yr -p A ), 
j,k,l,m 
the càdlàg property of c and с(р) and kn y An — 0 imply 
[I / As ]—ks H1 Р 6 t ikl 

(у = An D WOSu = A У [| O5, img (es) 60р)!" ds. 

E . 0 

i=l j,k,l,m 


On the other hand, Lemma 3.3 yields |v(1)” — Anw(p)""| < Kp An (АЛ + n(p)”) 


when i € L(n, p), whereas |w(p)7| < K always. Therefore 


[t/An] 
(Ша - wor) s KpAnE( У VAn +O) + КАЕ CLO, p. 0)). 
1=1 


Now, #(L(n, p, t)) is not bigger than (2k; + DN”, implying that A, E(#(L’(n, p, t))) 
< Ку An. Taking advantage of (3.15), we deduce that the above expectation goes 
to 0 as n — oo, and thus 


uy? E vay. (3.39) 


Next, v(2)? is B -measurable, with vanishing gm -conditional expectation, 


and each set {i € L(n, p)} is ES '? measurable. It follows that 
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(U^) x 2k, B| Seiad (002) |2 | ғ")) 
< Khan E( Zier BU | FP) < кофр кау, 


where we have applied (3.20) for the last inequality. Another application of the same 


estimate gives us 
/4 


(UGD < Кеф + Kpt An”. 


These two results and the property ¢, — 0 as p — O clearly imply 


lim lim sup Е( (2); ^| + [U(3); ^|) = 0. (3.40) 


р? n— oo 


(В) The processes U"'?. We will use here the jump times 5f, S$, .. . of the Poisson 
process N^, and will restrict our attention to the set ©, у, p defined before (3.12), 
whose probability goes to 1 as n — oo. On this set, L(n, p, t) is the collection of 
all integers i which are between [52 / А„] — 2k, + 2 and [55/ Aa] + 1, for some q 
between 1 and №. Thus 


NP [52 /А 1+1 
ule = = н, p.q), where H(n, р, 4) = >, v^. (341) 
а=1 i=[S4/An]—2kn+1 


The behavior of each Н (n, p, q) is a pathwise question. We fix q and set 5 = 87 and 
аһ = [S/ ^4], so 5 > а, A^, because 5 is not a multiple of A,,. For further reference 
we consider a case slightly more general than strictly needed here. We have с” — cs_ 
when a, — 6k, + 1 < i < an + 1 and c? — cs when a, + 2 < i < an + 6kn, 
uniformly in i (for each given outcome ш). Hence 


(kn — an +i — 2) A (Kn — 1) 


Э! п 
8; is Acs > 0 
uniformly ini € {an — 6k, + 2, ..., an + 5К.). (3.42) 
Thus, the following convergence holds, uniform in i € {an — 2k; + 1, ..., an + 1}: 


Jas v (s(cs- + boe nd Acs) g(cs_) 
" _ " 
— dim дт9(с5_) (a + i-e Аат) ) xlii 
which implies 


k,—3 


Hin, p. d) -VAn >; (9(с5,- 


и 
Tu Acs,) — 9(с,-) — > Amgcs,— pam) 0 


I,m 


and by Riemann integration this yields 
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: 1 
Н(п,р,4) > 8 | (ales, + wAcs,) — g(es,-) — w Y Amg(cs,-) Асу”) dw. 
0 l,m 
Henceforth, we have 


N? a 
P 
uu = > | (g(cs,— + wcs,) — g(cs,—) —w > Amg(cs,-) Ас) аш. 
q=1 l,m 
(3.43) 


(C) The processes U”™’. Since |8" < К we deduce from (2.6) that |v/"| < 
К/А, (paz || + a? |Р). (3.19) yields EQfa? 4 | £7) < K4 A4 for all q > 0. 
Therefore 


^ З 3/4 т rE 1/4 
(10|) < КАЕТ, n, p, t) € KA, 


by virtue of what precedes (3.39). We then deduce 


uee B o. (3.44) 


(D) Proof of (3.35). On the one hand, V^? = U(1)'^? + U(2)'? + U(3)'? + 
UP? + UP: on the other hand, the dominated convergence theorem (observe that 


©(ру, — 92 for all г) yields that U (1 — A? — 1 A? and 


1 
ву | (g(cs— + wAcs) — g(c,—) — w S Op g(cs-) Ас) dw 
0 


S<t I,m 


as p — 0 (for the latter convergence, note that lga +y) =g) >» Oi gx) y| < 


K ||y 2 when x, y stay inacompact set). Then the property (3.35) follows from (3.39), 
(3.40), (3.43) and (3.44). 


(E) Proof of Theorem 2.2. We are now ready to prove Theorem 2.2. Recall that 
JE (V(g)n; = V(g)) = 2 x Үү”, By virtue of (3.28), (3.29), (3.34), (3.35), itis 
enough to check that 


Aj А; + At Aj = $ Dim Jo Omg(cs-) dey” — 0 gler) 
— 2A? + B X у=: Jr (g(cs— + wAcs) — g(¢s—)—w Lin Omg (Cs—) Ne”) dw. 


To this aim, we observe that Itó's formula gives us 


t 
6 
(сг) = g(co) + уз) Әна ас!" = 5^ +} (gles— + Acs) — g(6-) 


l,m s<t 


= >, Omg (Cs—) Aci), 


l,m 
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so the desired equality is immediate (use also is wdw = 1), апа the proof of 
Theorem 2.2 is complete. 


3.9 Proof of Theorem 2.5 


The proof of Theorem 2.5 follows the same line as in Sect. 3.8, and we begin with 
an auxiliary step. 


Step (1) Replacing ©! by €". The summands in the definition (2.12) of A"? are 
p (1) Replacing c; by c; t 


RE, 05,4, ), where Ri, y) = Dj etm Fe 90007 — x/*)(yl" — х"), and 
we set 


[t/An]—2kn +1 
Аз yAn 5 


In Уп 
t 8 RCF > Cite) 


i=l 
We prove here that 


АЗ ar В, 0 (3.45) 


for all t, and this is done as in the step j = 1 in Sect. 3.5. The function R is C! on 
R2 with |9/ R(x, y)l < KQ + П + УР) for j = 0, 1, by (2.6). Thus 


IRC}. С ы) — RE". G4) 
< KA + IG? + ley WD? Cle? — e? H Wey, — 217, ID 
+ Ке? — 2" + Kiley, — 2" |. 
Then, exactly as in the case afore-mentioned, we conclude (3.45), and it remains to 
prove that, for all t, we have 


р 1 
АЗ > — л А? + Аў АЙ. 


Step (2) From now on we use the same notation as in Sect. 3.8, although they denote 
different variables or processes. For any p € (0, 1] we have A"? = U™P + U™P + 
U""-P. as defined in (3.37), but with 


п VAn n n „п n 

vp = р" RE + GF, Cig, t BL) 

т |. y An n эп „п эп In Qn т 
v" = —Yg* К(с + Вр, с + Вір), v = UF Y. 


Recalling ^7 in (3.18), ће decomposition (3.38) holds with 
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^n ч „jk Я 

v(1); = „м Хв Ki кп 9 (67) EC? qp Lee) 
jk 

v2)! = _уы aT -PR ger) rd ут!" - v0)" 

v3! = v! — v1)! — vy". 


Use С" — c? = B? and (2.6) and a Taylor expansion to check that 


WOI x Ky An 152121691 0 + 18D. 


We also have |v(2)7] < KA/ An Ily? |2, hence (3.20) and (3.26) yield 


An 
z(|v3)7] | g^) --EQvQY'? | GP) < K^, (o, dA zi 
p? 


and thus (3.40) holds here as well, by the same argument. Moreover, (3.26) again 
yields (3.39), with now 


E » ЖОО ( etn" + s elle i + ci") ds 


j,k,l,m 


This goes to A? — 1 A? as p — 0. 
Another application of (2.6) gives us 


jo) < KAn (1 + 15012) (Iro + Mu, 1 + Па? + Wa, 1). 


Then another application of (3.19), (3.20) and (3.26) yields E(v/"| | 07) < КА 
and we conclude (3.44) as previously. We are thus left to prove that 


р>0 = UP’ uP, wih,asp— 0, Ul" E Af. (3.46) 


Step (3) On the set €2, ‚р we have (3.41) and we study Н (п, p, q), in the same way 
as before, on the set 9, ‚р. We fix q and set 5 = S; anda, = [S/ An]. We then apply 
(3.42) andalso c? — cs. orc? — cs,accordingto whether a, —2k, +1 <i € an+1 
OF dn +2 € i < aj + ks, to obtain 0" —V? — 0, uniformly in i between an — 2kn + 1 
апаа, + 1, where 


0 if аһ — 2k, uc MO 
2k, —as-Hi 2 VAn 
{ : Wc ум 2d д, 1т g(cs— ) д Аст 
v= if an — 2k,» +3 < i < аһ = К +1 


(an—i+2)?./A 2 k,—ag-Hi 4-2 Jk 4 Am 
: 8k2 = 2 an дыт 9465 m A Acs Асу Асу 
ifa, — kr +2<i<a,+1. 
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We then deduce, by Riemann integration, that 


0 І т. 
H(n, p,q) > -3 5 ] Pran aes 95 es 01. Ac) w? Асу) Асу! dw, 
j,k,l,m 


P 
which is 0G'(cs, -, Acs,), hence the first part of (3.46), with О = gy G' 
(c 50, Ас 5). The second part of (3.46) follows from the dominated convergence 
theorem, and the proof of Theorem 2.5 is complete. 


3.10 Proof of Theorem 2.6 


The proof is once more somewhat similar to the proof of Sect. 3.8, although the way 
we replace C? by C" and further by а” + 3? is different. 


(A) Preliminaries. The jth summand in (2.15) involves several estimators C", span- 
ning the time interval ((j — 3)k, An, (jJ + 2)k4 An]. It is thus convenient to replace 
the sets L(n, p), L(n, p, t) and Lin, р, t), for p, t > 0, by the following ones: 

L'(n, p) = U = 3,4,...: No pay, 7 М-З, = 0l 
L'(n, р, 1) = {3, жу [t/kn An] = 3} n L'(n, p) 
L'(n, p, t) = (3, «5 [t/Kn An] — 3) 0 (AL (n, p)). 


For any p є (0, 1] we write V(F)? = V^? + YP where 


= F( 3, 41:956) Ци" гелме", ү] уш, <Ne 


np yep i n 
у = View, p.t) vj Y m = ерп, Ui. 
We also set 
Snc! | yn _ an по _ оп n 
Ja Ciber = 60-21 058 = Въ T Bray er 


= rd J, = 2 
ш} m 2-4 lC my, 200 @ 1 m)kn+ ull. b г = (1 + Ile [E 3)kn +10 (d+ 19721) Ё 


Equation (3.10) and the last part of (3.19) yield 


qz1 > (изу КА НЕМЕ,  E(w"y)sK,. (3447) 


Observe that 5c’ is analogous to ^7, with a doubled time lag, so it satisfies a version 
of (3.26) and, for q > 2, we have 


q/2 
. D 4 An 
i e (п, p) = E(Il95€ | Fee oy н) S Kalv An bp + FAV pt ) 


(3.48) 
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(B) The processes >”. (2.16) yields 
Il КОТ + 20 зу, P 1872112 1үгу>и„) + 11321”. 
Thus a (tedious) computation shows that, with the notation 
aj = (+ [@ cs, a DP? MOT IP цвети; 72 
j j 


a" = _ ш" (wr E (и)? + a). 


with v > 0 arbitrary, we have DA < K (a5 + бус? + а”) (with К depending on 
v). Therefore we have |Vj^| < K(B; ^ + СГ? + D”), where 


[t/ ks An] 
np n npo пу yp n __ n 
B= У an G= У, 1001", | DP = У, а. 
jeL'(n,p.t) jeL'(n,p.t) j-3 


First, (3.47) and Hólder's inequality give us (аў) < < Kg, Ay (9-0) for anyg > 1 
and v > 0, where (recalling (2.7) and (2.14) for w and w’) we have set /(qg, v) = 
i-e —(р(1 —20)Vvi-20+a0 ?). Upon a v small enough and q close 


coni to 1, and in view of (2.7), we see that /(q, v) > 5, thus implying 


SUD > 0. (3.49) 


Next, we deduce from (3.48) that 


так ( ( 2, 110/21 | g”) < K1(op + AD^ + ES 


p 
ieL'(n,p,t) p 


and thus, since p > 3, 


lim lim sup E(|C; ^|) = 0. (3.50) 


p>0 n—oo 


The analysis of В!" is more complicated. We have ove’ = 27 + 2", where 


it ee 
n an An A NE n — d. 
Zj 5 jkt 90-2) fj yp Do (ntm — C-t) 
m=] 


(recall (3.36) for a7), hence 


2 
aj S 40 + [@ s, D^ (1z; \ Це tug say + П 12 Lagen, 73! 
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It easily follows that for all A > 1, 


2P 
Bee < 16 B^" ФАР 2ВРР — pen, (rob 


where 


я a Пе? 
BP =D eropp Am, aV = +N s aD? Se, 


2 p-l 2 
aQ) = |1 Цап as aO} = WEG зур l 101. 


On the one hand, (3.19) and Hólder's inequality yield E(a(1)" | ^) < cae 


we deduce 


and, since w’ < І, 


(В?) — 9, (3.52) 


On the other hand, observe that zi = up), with the notation (3.27), and as soon 
as j € L'(n, p), so Lemma 3.4 gives us 


lim lim sup Ne) = 0. (3.53) 


p noo 


Finally, (3.13) shows that E(| de IE Ge Р оњ, ap) S Кору An for all q > 2 and 
j € L'(n, p), whereas с 3), H1 is FG 21 -measurable, so (3.13), (3.19) and 


successive conditioning yield (а (3) | GP) < Кору An. Then, again as for (3.52), 
one obtains 


(B^) < Kpt. (3.54) 


At this stage, we gather (3.49)-(3.54) and obtain, by letting first n — oo, then 
p = 0, then A — oo, that 


lim lim sup (И) = 0. (3.55) 


p—0  n—oo 


(C) The processes Y"^?. With the previous notation st and №?, and on the set 9, p 
we have 


Уг? = S: Y UC Jka Anl+j (3.56) 


т=1 ј=—2 


This is a finite sum (bounded in л for each ш). Letting S = S? for m and p fixed and 
Wn = ү DA -| EE 1. we know that for any given j є Z the variable сү ([S/kn An + Dis 1 
converge in probability to cs— if j < O and to cs if j > 0, whereas for j = 0 we 


P RE А n 
have C's jy. д] 41 7 UnCs (1 — ш„)с$——>0. This in turn implies 
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; А = РЁ 
j<0orj>2 > OLS /kn Anlt jo? 9» 


ox P ox P P 
бд, — (1 — wn) Асѕ 0, Ó[S/k, Au ]-16—9 ACS: буы An]+2 — w,Acs—0. 


By virtue of the definition of v^, and since и’ — 0 and also since w; is almost surely 
in (0, 1) and F is continuous and F(x, 0) = 0, one readily deduces that 


P F(cs_, Acs) if j = 1 
уыл» 7 n if j #1. 


Coming back to (3.56), we deduce that 
Vr? D, W i5 V Flesp_, Асур). (3.57) 


In view of (2.16), an application of the dominated convergence theorem gives y 
У(Е),. Then (2.17) follows from V(F)? = Vj? + Vi"? and (3.55) and (3.57). The 
proof of Theorem 2.6 is complete. 


Acknowledgments We are grateful to the referee for his/her very careful reading of the paper. 


References 


1. Alvarez, A., Panloup, P., Pontier, M., Savy, N.: Estimation of the instantaneous volatility. Stat. 
Inference Stoch. Process. 15, 27—59 (2010) 

2. Clément, E., Delattre, S., Gloter, A.: An infinite dimensional convolution theorem with applica- 
tions to the efficient estimation of the integrated volatility. Stoch. Process. Appl. 123, 2500-2521 
(2013) 

3. Jacod, J., Shiryaev, A.N.: Limit Theorems for Stochastic Processes, 2nd edn. Springer, Berlin 
(2003) 

4. Jacod, J., Protter, P.: Discretization of Processes. Springer, Berlin (2012) 

5. Jacod, J., Rosenbaum, M.: Quarticity and other functionals of volatility: efficient estimation. 
Ann. Stat. 41, 1462-1484 (2013) 

6. Vetter, M.: Limit theorems for bipower variation of semimartingales. Stoch. Process. Appl. 120, 
22—38 (2010) 

7. Dette, H., Podolskij, M., Vetter, M.: Estimation of integrated volatility in continuous-time finan- 
cial models with applications to goodness-of-fit testing. Scand. J. Stat. 33(2), 259-278 (2006) 


